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Understanding Math and Science 


To be numerate means to be acquainted with the basic principles of mathematics 
and how they are employed to describe and interpret information. 


To be scientifically literate means to have a broad understanding of how science 
works, what science is and what it isn’t, and to be familiar with knowledge gleaned 
from science. We live in an age of scientific discovery and scientific issues are the 
subject of many important debates in society today. Every citizen is faced with issues 
whose discussion requires some scientific background. 


The Oxford Illustrated Dictionary defines science (in part) as: “systematic and 
formulated knowledge, pursuit of this, principles regulating such pursuit;” “dealing 
with material phenomena based mainly on observation, experiment and induction,” 
“the systematic study of the phenomena of the material universe and their laws” 


“Scientia” is the Latin word for knowledge. Science is all about knowing: 
knowing ourselves, knowing our environment, and the relationships between the two. 
Science is the quest for knowledge and understanding; scientific knowledge and 
understanding are based on constantly observing and examining nature. 


The laws of the physical world are not random; the universe’s behaviour is not 
random - collecting and analyzing observations — performing science according to a set 
of principles and rules allows us to uncover the laws of nature and thus make 
predictions! 


Science, then, begins with the presumption that there is a set of laws that govern 
the universe and that through carefully considered application of logical thought 
processes, these laws can be understood. Just because we do not understand or know 
everything about the behaviour of the physical universe does not mean that there is no 
predictable pattern nor does it mean there are unknowable things. The collective body 
of knowledge gleaned from scientific activity has grown from ancient times, continues 
to grow and there is no reason to think it will not continue to grow indefinitely into the 
future. The logic of modern science requires that data (measurable, objective 
observations) are the basis for the validity of theories. Assertions are made based on the 
evidence and on sound logic applied to interpret the evidence. Scientific theories are 
derived from fact — not the other way around! 


It is important to realize that science isn’t technology. A common mistake is to 
confuse science with technology, and so science often receives undue credit or undue 
blame for many things. When speaking of the miracles of modern science we should be 
speaking of the miracles of modern technology. For instance, scientists developed the 
understanding and knowledge of the principles governing radiation that allowed for 
the invention of the modern microwave oven. Inventing a microwave oven is not 
science; it is a technological development. Science discovered the workings of nuclear 
physics; technology invented the nuclear bomb and human beings employed with 
devastating consequences. Scientists’ work is the generation of knowledge and 
understanding; engineers’ work is the business of generating technology — applying 
scientific knowledge to certain purposes, solutions to problems. Scientists often use 
extremely sophisticated technology in their experiments and in the analysis of data, but 
science does not require technology. Science uses technology and can lead to the 
development of technology, but science isn’t technology and it most certainly isn’t the 
use of technology. 


It is also important to realize that science isn’t “truth”. People often assume that 
scientists generate a body of knowledge that is sure to be “true”. Some ideas are known 
with enough certainty that most of us taken them for granted and regard them as ‘true’. 
We consider them true in the absence of any evidence to the contrary, despite the 


collection of massive amounts of observations. The common assumption that the earth 
orbits around the sun is taken to be true because there is a great deal of evidence that 
supports that position. However, it is just a theory — no human has ever physically 
observed the solar system and actually seen the earth travel around the sun. Thus it is 
just a theory — albeit a highly robust one due to the weight of evidence. 


Scientists do seek to find what is true, but they do not generate truth, or even 
certainty. They generate knowledge. They propose and test theories that are based on 
evidence, knowing full well that future evidence may result in these theories being 
refined or even rejected and ultimately replaced. This does not mean the original theory 
was “wrong”; if the science was sound, it means that the theory was a legitimate result 
of the observations and their interpretations at that time. When scientists speak about 
an issue, they will often use language such as “The evidence suggests that....” or “Our 
current understanding is....” None of this language indicates waffling or indecision, 
rather it should be taken as a sign of conscientious attention to the nature and methods 
of science. We only know the observable evidence; we reach the best possible 
conclusion based on the evidence at hand. Advances in technology can allow collection 
of ever-more nuanced observations and greater evidence that can result in refinement or 
replacement of theories. 


Science is almost always communicated at least in part using mathematical 
language. The word mathematics is derived from the Greek word mathematikos, which 
means “inclined to learn”. So, to be mathematical means to be interested in learning! 
Mathematics is not empirical. Science is. For scientists, mathematics is often considered 
a ‘language’, and gives us way of modeling the physical world. Science uses 
mathematics and so scientists need an understanding of how mathematics works to 
have an understanding of how it is applied. Without a fundamental understanding, we 
cannot properly use math to represent and analyze information and we will come to a 
flawed understanding of the physical world. The study of mathematics itself is another 
matter and pertains to the body of knowledge and reasoning about numbers, spatial 
forms, logic and the relationships between them. Mathematics is applied to 
measurement and prediction in science. Traditional mathematics courses focus on one 
field/branch at a time and are concerned with the comprehensive analysis of the rules 
and applications of that particular field. Some of these are: 


arithmetic: “the art of computation by figures” in other words, methods for 
operating on numbers 


algebra: “investigates the relations and properties of numbers using symbols” 
these symbols represent things, so algebra is a way of working with unknown (or 
variable) quantities of things. “Things” can be physical objects or they can be 


measures physical quantities or physical properties. “Things” can a set of 
observations, of data 


geometry: “investigates the properties and relations of magnitude in space as 
lines, surfaces or solids” geometry allows us to analyze and understand 
relationships between measurements (observations) of size, shape, distance, area 
and volume 


trigonometry: “deals with the measurements of sides and angles of triangles” 
and allows us to uses the properties of triangles to understand relationships in 
the physical world. 


calculus: “a system or method of calculating” The rules of calculus allows us to 
study and understand quantities that change over time or over space by 
representing behaviour of things using mathematical functions 


statistics: deals with the “collection, classification and discussion of facts bearing 
on the condition of a state or community” Statistics gives us the tools for sound 
collection and analysis of data and for identifying and testing the strengths of 
relationships amongst various sets of observations. 


probability: the rules determining “the extent to which a particular event is 
likely to occur”. Probability is the formal study of chance; when understanding 
the physical world through collected data and its analysis, it is important to 
consider all possible outcomes and be able to extrapolate the chance of a given 
outcome in a single experiment or observation to a distribution of a set outcomes 
or measured values when extremely large numbers of experiments or 
observations are taken. The probability of a single coin landing heads-up in a 
single flip is mathematically related to the % of heads that is obtained when 
thousands of coins are flipped thousands of time. Nature provides us with 
distributions of values of physical properties; different properties have their own 
unique distribution related to the probability of a single measurement being a 
specific value. Understanding this is vital to having a solid scientific 
understanding of natural phenomena. 


logic: “the branch of philosophy that treats the forms of thinking in general, and 
more especially of inference and of the scientific method”; the study of the 
principles of reasoning (deductive and inductive). 


(All quotes from the Oxford Illustrated Dictionary, online) 


You have, no doubt, varying degrees of familiarity with all of the above branches 
- many of these you encounter & use in every-day life while others are rather more 
esoteric and may be something you spend little time contemplating, much less using. 
Nevertheless, our scientific understanding of the world, and the vast benefits we derive 
from that understanding would not be possible without mathematics. Mathematics will 
be used throughout this course and you will gain a working knowledge of several of the 
mathematical disciplines given above. This course will introduce you to the physical 
sciences and the biological sciences through the study of earth sciences, natural 
disasters and microbiology. You will encounter different branches of mathematics and 
their use throughout. You will learn to understand and apply different mathematical 
techniques to aid in your understanding of different scientific topics. 


It is important to realize that there is no one single mathematical branch that can 
be used exclusively to do science; the pursuit of knowledge via the methods of science 
makes use of all branches of mathematics. Each discipline in science may rely more 
heavily on certain branches of mathematics than on others; without all of mathematics, 
though, we could not do all of science. 


Given the above discussion, we can re-state our definition with more precision: 
the primary goal of numeracy and scientific literacy is to develop the quantitative 
reasoning skills to be an informed citizen in a complex and ever-changing world. 
Quantitative reasoning combines mathematical skills with the ability to approach 
information (i.e., evidence), develop and appreciate knowledge and approach problems 
in a critical and analytical way. Having some familiarity with the ideas of critical and 
logical thinking will help develop that ability. 


People “believe” all sorts of things not because they have seen and understood 
the science behind something, but because they have faith that what they read or what 
they hear even in casual conversation must be “true”. For example, many people 
believe aspartame causes brain cancer, vaccines cause autism, goji berries cure cancer, or 
that fluoride in drinking water is a health hazard. These - and many, many other issues 
of the day do not have to be taken on faith; they are not transcendental, they can be 
tested by examining the evidence with a critical mind. Fora variety of reasons, many 
things are hard to properly investigate but they can be put to the test and confirmed, or 
dismissed. If things cannot be tested, argued scientifically according to the rules of 
logic, scientists are not talking about them. 


Chapter 1. Measurement and Units 


What are units for? Where do they come from? Why are they important in 
science? Science is concerned with collecting observations — therefore we need 
objective, commonly agreed to references for describing the data we collect. A number 
without units is a quantity — a number with units is a quantity of something, 
something that scientists are observing. With units there is unambiguous description, 
manipulation and interpretation of observations. Further, when units are standardized, 
there is no quarrel, no other possible interpretation. Thus, without units there can be no 
science; without units we cannot make interpretations and predictions of any part of 
our physical universe — from understanding nuclear energy, to plate tectonics and its 
manifestations and consequences, to our electricity bills, to the growth or depletion of 
our savings in our bank accounts. 


The various branches of mathematics are applied to these quantities-with-units 
in order for us to understand the world around us, to see relationships and to be able to 
see lack of relationships clearly; this understanding allows us to make numerous 
properly-informed decisions, from the rather trivial decision of which breakfast cereal 
to purchase on a given shopping trip to the more consequential decision of whom to 
vote for when candidates have different position on a topic and each claims theirs is the 
“true” position, to the possible life-or-death decision about whether to seek medical 
treatment from conventional or alternative medicine. 


1.1. Units, Unit Systems and Conversion Factors 


In order for them to be meaningful, measurements of physical quantities must be 
expressed as multiples of commonly agreed to units. Units are the standards, the 
reference point for our measurements. The same length can be expressed as 10 inches, 
0.833 feet, or 25.40 centimeters. The units ‘inches’, ‘feet’ and ‘centimeter’ refer to 
different standard lengths. Historically, units were defined to suit the immediate needs 
of the person or community making the measurements. Units of length varied from 
country to country, town to town and even from one profession (tailoring) to another 
profession (carpentry). Most units in use were loosely defined in reference to a body 


part — a convenience since a person carried around their measuring devices everywhere. 


Thus, an ‘inch’ was a thumb-width; a ‘foot’ was the length of the British king’s foot; a 
‘fathom’ (used to measure ocean depths) was defined as the distance between the 
finger-tips on opposite hands when the arms are outstretched in a straight line. 
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In the late 1700°s, the French Academy of Sciences recommended that for 
consistency an international standard of length be adopted and that unit of length 
should be based on the size of the earth. In other words, there should be one physical 
reference for length. The unit recommended was called the metre and was defined as 
one ten millionth the length of a straight line from the equator to the North Pole 
through Paris. The Academy prepared a ‘standard metre’- the distance between two 
lines engraved on a platinum-iridium bar. The original standard is kept in France; 
copies of the bar were made and distributed around the world. Later measurements of 
the earth, more accurate than those available to the French Academy of Sciences, 
revealed that the earth is somewhat larger than previously supposed. This is of no 
practical significance, since the standard bars were retained as the standard length for 
scientific measurements everywhere. 


It is now possible to measure lengths much more precisely using optical methods 
than was possible by measuring the distance between two lines engraved on a metal 
bar. In 1960, the International Bureau of Weights and Measures defined the meter as 
exactly 1 650 763.73 wavelengths of the orange light emitted by atoms of krypton gas. 
All other units also have external, physical objects or physical phenomena as their 
references. 


The names of the units and their relative magnitudes are also a matter of 
convenience and have developed through common practice in particular fields. Thus a 
‘light-year’ is the common unit of length used in astronomy where the distances 
between celestial objects are enormous. A light-year is the distance that light travels in 
one year. Light travels at a speed of 300 000 km in one second, therefore it travels 9 500 
000 000 000 km in one year. Measuring these vast distances in units of light-years is far 
more straight-forward than measuring the same distances in miles or kilometers. 


We also have BTUs, electron-volts, joules and calories - all used for energy; 
kilograms, pounds and ounces - used for mass and weight; seconds, hours, months and 
years - used for time. Consequently it is necessary to have conversion factors to allow 
observations or results expressed in one unit to be compared with measurements made 
using other units. 


A conversion factor is a number-with-units that is used to multiply a number 
with one type of unit (describing a physical measurement of some sort) in order to 
express that same physical measurement as a specific number of another type of unit. 


The physical distance described by the term “1 inch” is identical to the physical 
distance described by the term “2.54 centimetres”. While this seems trivial at this point, 
it is important to really think about our thinking, because there will be other instances 
where the same rigour in thinking is required to be applied to a situation that is not so 


trivial. If we are clear about our thinking processes, we will be able to handle situations 
we have never before encountered with confidence and be able to solve problems, and 
gain new understanding. 


Deriving a conversion factor: 
1 inch = 2.54 cm 


Therefore, we can write two conversion factors from this equality by writing two 
ratios: 


1) by taking the ratio of inches to centimetres, we can derive a conversion factor to use 
when taking a measurement expressed in centimetres and converting it to a 
measurement expressed in inches: 


(1 inch) / (2.54 cm) or (1/2.54) x (inch / cm) 
which gives us: 0.394 inch/cm 
and we say: 0.394 inches per centimeter. 


Thus each centimeter counts for 0.394 of an inch. 


If we read that a certain measurement is 25 cm, and we are not familiar with the metric 
system (we are used to dealing with inches), we can then convert cm to inches using: 


25 cm x 0.394 inch / cm = (25 x 0.394) inches = 9.84 inches 


2) by taking the ratio of centimeters to inches, we can derive a conversion factor to use 
when taking a measurement expressed in centimeters and converting it to a 
measurement expressed in inches: 


Note that (2.54 cm)/ (1 inch) 
which gives us: 2.54 cm/inch 
and we Say: 2.54 centimeters per inch. 


Thus each inch counts for 2.54 centimeters. 
If we read that a certain observed measurement is 27 inches, and we are only familiar 
with the metric system, then 


27 inches x 2.54 cm/inch = (27 x 2.54)cm = 68.6 cm 


As the understanding of our physical environment and the science of this 
physical environment progressed, it became evident all physical quantities which could 
be measured — volume, density, energy, acceleration, electrostatic attraction, 
gravitational force etc. — could be expressed in terms of a few fundamental entities such 
as mass, length and time. The fundamental entities are measured in so-called base 
units. These base units include: 
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mass is a fundamental physical entity measured in units of grams, kilograms, 
pounds (technically a ‘weight’), tons or even ‘stone’ (guess what the external 
physical reference was for that unit??) 


distance is a fundamental physical entity measured in the base units of inches, 
centimeters, miles etc 


time is a fundamental physical entity measured in the base units of seconds, 
hours, days, etc. 


speed is not a fundamental physical entity: it is defined as distance travelled 
divided by the time required to travel that distance. An instrument that 
measures speed is actually measuring two fundamental entities: distance and 
time. The instrument does the arithmetic and displays the calculated quantity of 
speed. 


area, volume, pressure, force, energy, and power are all examples of properties 
described by derived units. 


Energy is associated with moving an object of specific mass (mass is a 
fundamental entity) along a specific distance (distance is a fundamental physical entity) 
in a specific amount of time (time is a fundamental physical entity). Thus the amount of 
energy expended in going out for a run on a Saturday afternoon is based on the mass of 
the runner, the distance travelled and the time required to cover that distance. Energy 
is expressed in terms of measurements of these three fundamental physical entities. The 
runner has done some work and expended some energy. The energy expended in 
exercise is commonly expressed in units of Calories. It can also be expressed in units of 
Joules. Both the ‘Calorie’ and the ‘joule’ are examples of derived units. A derived unit 
is one that can be expressed in terms of so-called base units. Base units are the units of 
measurements that cannot be expressed in terms of other, more simple units; base units 
measure fundamental physical entities. But it would be very confusing indeed if we 
reported things like energy in terms of all the fundamental units involved. 


Example: 1 Calorie = 4,184 Joules 
Since a Joule is a derived unit, it can be written in terms of it base units: 
1 Joule = 1 kg m2/s2 


Imagine you have just consumed a 250 Calorie snack. You would never report to your 
friends that your snack was 1 046 000 kg m2/s2. It just doesn’t make sense in everyday 
life. (Unless your friends are all physics majors!) 
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The Calorie was first defined as the amount of energy required to heat 1 kg of 
water by 1 degree Celsius; it likely wasn’t obvious at the time that the unit Calorie is a 
derived unit. If 1 kg of water had its temperature raised by 1 degree, it absorbed 1 
Calorie of heat energy. So it is comfortable to talk about exchange of energy in terms of 
numbers of Calories or numbers of Joules. 


An erg is also a unit of energy. An electron-volt is a unit of energy. Even though 
you may have no idea whatsoever what these terms mean, you do now understand that 
they are units derived from base units and must somehow be related ultimately to 
mass, distance and/or time. (ergs and electron-volts are widely used by electrical 
engineers and physicists and the terms make sense in those contexts). 


All units systems have base units for the quantities of mass, length and time. All 
units systems have derived units for speed, acceleration, force, energy, pressure and the 
like. Where unit systems differ is in the external references (the units themselves) used 
for measuring and expressing the fundamental physical quantities of mass, length and 
time. All unit systems are based on measurements of naturally occurring physical 
phenomena. 


The International System of Units (S.I.) is the modern form of the metric system; 
it is devised around a set of specific base units and multiplication factors of 10. The 
base units used are the meter, kilogram and second (Table 1). (There are four other base 
units in this system used to measure the fundamental physical entities of electric 
current, thermodynamic temperature, luminosity and amount of substance. These will 
not be part of our course of discussion). The Imperial/US system of units is based on 
the units of foot, pound, and second. 


The metric or S.I. system has been almost universally adopted. Canada uses it 
for all legal purposes but we still use imperial units in many facets of everyday life. The 
United States has not adopted it, but U.S. scientists use it in their work and in their 
publications since it is the accepted system of units in the physical sciences. Engineers 
use yet another system of units. 


Table 1. Metric Conversion Table 


a f p n H m c d k M G T P E 
atto |femto| pico | nano | micro | milli | centi | deci | kilo | mega | giga | tera | peta | exa 
1048 | 1045 | 10-42 | 10° | 106 | 103 | 102 | 10+ | 103 | 106 | 10° | 101:2 | 1015 | 10:8 
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Examples: 1km=103m 1nm=10°m 1GJ=109J 
1km=1000m_ 1nm=0.000000001 m 1 GJ = 1 000 000 000 J 


In order to clearly evaluate, understand and compare observations and 
measurements made using different systems of units, conversion factors are used 
extensively to convert between the various systems (Table 2). When comparing the 
measurement of a fundamental physical property in one unit system to its measurement 
in another unit system, a conversion between these two base units is applied, as in the 
previous example. 


When comparing a physical property that is expressed in derived units in one 
unit system to the corresponding derived units in another system, ALL of the base units 
used in the derived unit must be converted in order to express the property in a new 
unit system. There are conversion factors to convert a derived unit in one system to a 
derived unit in another system; these conversion factors have taken into account the 
conversion of all the base units that are part of describing the property in the new 
derived unit. 


1.2. Metric and Imperial Units 


Even though the metric system is the official system of unit measurements in 
Canada (SI units) we still use the imperial system of units in some, if not most, 
circumstances. When buying gas at a gas station we buy gasoline in litres, not gallons. 
When driving on a highway the speed limit signs are posted in kilometers per hour 
(km/h), not in miles per hour (mph). Both of these examples are instances of SI units. 
However, if you want to build your deck in your backyard you better know the 
imperial units; the local stores tend to prefer deck dimensions in feet and inches. If you 
want to brag about your ability to bench press, you talk in pounds. When asked how 
tall you are, you respond in feet (ft) and inches (inch), unless it is a passport application 
and then you would record it in centimetres (cm). It is fair to say that living in Canada 
requires you knowing both units of measurements: the imperial system and the metric 
system. As a result we need to be comfortable with both of these units, understand 
them, and most importantly be able to do conversions between them. We want to 
practice the unit conversions that you might face in everyday life. 
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Table 2. Conversions between Metric Systems 


1 inch = 2.540 cm 1 watt = 1 joule per second =1J s 

1 ft = 0.3048 m 1 horsepower (hp) = 746 watts 

lyd =3 ft 1 kilowatt hour (kW h) = 3.6 x 106 joules 
1 mile = 1.6093 km 1kWh = 3412 BTU 

1 fluid ounce = 29.574 mL 1 Calorie = 4,184 joules 

1 mL = 1 cm? 1 Calorie = 1000 calories 

1 litre = 1000 mL 1 atmosphere = 14.7 pounds/inch 

1 litre = 0.2642 U.S. gallons 1 pound = the weight of 454 grams 

1 light year = 9.46 x 1015 m 1 joule = 1 x 107 ergs 

1 mile? = 2.589 km2 1 electron volt (eV) = 1.6 x 10 -!9 joules 


The Imperial system has a long tradition. Much of the imperial units stem from 
concrete measurements, for example the foot (unit of distance) or the horsepower (unit 
of power). Still this practicality is also its main problem: a lack of uniformity. For 
example, we have two kinds of gallons used in Canada, the US gallon and the imperial 
gallon. The metric system was created in order to have a common and uniform system 
of unit measurements. But, when it comes to an intuitive feel, few people might know 
or care that one meter is the distance traveled in one second by that orange light emitted 
by krypton atoms, divided by 299,792,458. 


The vast majority of countries, including Canada, have adopted the metric 
system. However, in countries that have a long tradition of imperial units this adoption 
is either slow (Canada) or non-existent (US). A carpenter in Calgary might have his or 
her building instructions in the metric system; however, when talking to his or her 
clients they will most likely speak in feet and inches. 


Depending on the situation at hand we use these two units of measurement. It is 
not a question of which one is correct; it is a question of which one is the most 
appropriate for the given situation. However, we definitely want to avoid accidents 
stemming from unit measurement conversions. The metric system is an international 
system of units and therefore it is used in the sciences and on the international stage. 
Having said that imperial system is used in the US and care has to be taken with 
conversion. 


Units of weight and mass get confused in the metric system. When asked how 
much they weigh, people tend to respond with kilograms, which is incorrect. A 
kilogram is a unit of mass and not weight. Mass is the measure of the amount of matter 
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in something. Weight is the force experienced by a certain mass. Pounds are the correct 
imperial units for weight. On earth, weight refers to the force of gravity acting on the 
mass of the object in question. Your mass would be the same on all planets, however 
your weight would change depending on the planetary gravitational acceleration 
(gravitational pull). 


1.3. Use of Energy 


We all have a need for energy. We need to heat our homes and we might talk in 
gigajoules (GJ) of energy. We use electricity and we measure the amount of kilo-Watt- 
hours used (kWh). When we BBQ or try to cool our homes in the summer, we talk in 
British Thermal Units (BTUs). We try to lose weight and talk about burning Calories. 
Quantities of energy are floating everywhere in both the metric system and the imperial 
system. 


In scientific research environments, energy is measured in Joules. In other 
situations, however, the energy is often reported in kilo-Watt-hours (kWh), for example 
on your electricity utility bill. One Watt is a quantity of power that measures the energy 
dissipated in a unit of time. Power, therefore, is a derived unit and is a measure of the 
amount of energy produced or consumed used per unit time. Recall that energy is also 
a derived unit! Specifically, one Watt is defined as one Joule per second., W = 1J/sec. 
Therefore, one kWh (kilowatt-hour) is the energy needed to sustain the power of 1,000 
Watts over the period of one hour: 


1 kWh = 1, 000 W h = (1, 000 J/sec) h x (3600 sec/h) = 3.6 x 106 Joules. 


In the imperial system the energy is measured in the British Thermal Units 
(BTU's). One BTU unit is the heat required to raise the temperature of one pound of 
water by 1 degree of Fahrenheit from 60 degrees Fahrenheit under the constant pressure 
of one atmosphere. Roughly speaking it is the heat contained in one wooden match. 
One BTU is approximately equal to 1055 Joules. When you buy an air conditioning unit 
in Canada the power reading tends to be in BTU's per hour (BTU/h) since the main 
market for air conditioning units is the US and they use imperial units. 


Air Conditioning: Suppose you purchase an air conditioning unit with the power 
reading of 5000 BTU/h. This is a small-sized air conditioning unit for a small bedroom. 
Given that your electricity price is 8 cents per kWh how expensive is it to run the unit 
continuously for the month of July? 
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Solution: We need to look up the fact that 1 kW roughly corresponds to 3,412 BTU/h. 
Therefore, the power for our unit is: 
We have 31 days in the month of July: 
(31 days) x (24 hours/day) = 744 hours 
Thus we use: 
(1.465 kW) x (744 hours) = 1089.96 kW h in July. 
The cost of running our unit continuously for the month of July is: 
(1089.96 kWh) x (0.08 dollars / kWh) = 87.20 dollars. 


Another important utility bill in Canada is the gas bill for heating your home. 
The gas that we burn for heating is usually natural gas. Natural gas is delivered to your 
home through a network of underground pipes. The piping comes out of the ground 
and connects to the gas meter. The meter measures the volume of gas by counting the 
number of times 100 cubic feet of gas (one CCF - the first C stands for 100, the next for 
the word cubic, and F for feet) passes through the meter. On your gas bill the energy 
used is typically reported in Giga Joules (GJ) since the unit of Joule is extremely small. 
On some bills the energy derived from the natural gas is reported in thousands of cubic 
feet of gas. We have 1,000 cubic feet of natural gas (one MCF) yielding about 1.05 GJ. 


Gas vs. Electricity: Based on your utility bill, compare the cost of energy derived from 
the natural gas and from electricity. 


Solution: We collect the following information on a certain 2010 utility bill. The cost of 
one GJ derived from the natural gas costs $3.062. On the other hand, the cost of one 
kWh derived from electricity is $0.05951. Now: 


1 kWh = 3.6 x 106 Joules = 0.0036 GJ, 
thus one kWh of energy derived from the natural gas costs: 
(0.0036 GJ)(3.062 dollars / GJ) = 0.0110232 dollars. 
Recall one kWh of energy derived from electricity costs $0.05951. Therefore, the 


ratio of the cost of energy derived from electricity to energy derived from natural gas is: 


(0.05951 dollars) : (0.0110232 dollars) 


The energy derived from electricity is 5.4 times more expensive than the energy 
derived from the natural gas. In the circumstances described above, it is more than five 
times as expensive to heat with electricity than with natural gas. 
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Cost of Heating vs. Air Conditioning: A certain January gas bill statement claims your 


gas furnace has consumed 12,000 cubic feet of natural gas. Estimate the average power 
associated with your gas furnace in BTU's per hour and compare this gas furnace 
average power with the 5,000 BTU/h air conditioning unit. 


Solution: We need to collect the following information: 1 cubic foot of natural gas yields 
about 1,020 BTU's. Also recall that 1 Watt is approximately 3.41 BTU/h. The power of 
the gas furnace is obtained by dividing the total BTU's used in the month of January 
divided by the number of hours in January: 


(12,000 CCE) x (1,020 BTU /CCF) = 16,452 BTU/h, 
(31 days) x (24 h/day) 


thus the power of the gas furnace is more than three times higher than the 5000 BTU/h 
air conditioning unit. 


The overuse of the non-renewable energy sources is becoming an issue primarily 
for two reasons. First of all the non-renewable resources such as oil or coal are resources 
that once used up will not get replaced. Second, the use of these non-renewable carbon- 
based resources leads to the possibility of global warming of our planet by increasing 
the carbon dioxide concentration in the atmosphere. One of the purported global 
warming culprits is the automobile. Recently, fuel consumption of our vehicles became 
a bigger issue not only from the financial standpoint (higher oil prices) but also from the 
ecological standpoint. 


When we talk of car mileage, we typically talk in miles per gallon, mpg. In fact 
we have two types of gallons, the US gallon and the Imperial gallon. The US gallon is 
roughly 3.79 liters and the imperial one is 4.55 liters. However, Canada being a SI units 
country, we do now and then hear the measurement of how many liters of oil is 
consumed per 100 km. 


Fuel Consumption: The fuel consumption of my favourite car, the Dodge Charger 2.7 
V6, is rated at 9.7 liters per 100 km highway driving. Convert this fuel consumption to 
the ever popular mileage of miles per gallon (mpg). Here, we mean the US gallon. 


Solution: One mile is approximately 1.61 km. If we measure the mileage in US gallons 


we need : 
(9.7 litres) /(3.79 litres/US gallon) = 2.56 US gallons to travel 100 km 
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and (100 km)/(1.61 km/mile) = 62.11 miles 


This yields the mileage of 62.11 miles/2.56 US gallons = 24.3 mpg (US). 


Fuel comparison: A friend told you that her Ford Focus got 44 miles per gallon on the 
highway. Can you translate to litres per 100 km and comment whether this is most 
likely an imperial gallon or the US gallon? 


Conserving a Non-renewable Resource: The minimum mileage regulation for cars on a 
highway is set at 27.5 mpg (US gallon). This mileage is achieved at the ideal speed of 55 
mph (miles per hour). Of course, in reality very few cars go at 55 mph. For every 
increase of 1 mph in speed of a car (above 55 mph) there is a 0.5 mpg decrease in the 
mileage. We collected the following stats for cars traveling on highways: 


12.5 % travel at 70 mph 
50 % travel at 75 mph 
25 % travel at 80 mph 
12.5 % travel at 85 mph 
Which scenario would yield better fuel efficiency? 


Scenario I: Increase the ideal minimum (highway) fuel efficiency for cars from 27.5 mpg 
to 30.2 mpg (2010 US proposal) 


OR 


Scenario II: Somehow convince people to drive at speeds less than or equal to 70 mph. 


When shopping for vehicles some people care about the horsepower reading at 
certain rpms (rotations per minute). One horsepower is an imperial unit of power, 
traditionally used in the automotive industry. This unit tried to capture the actual 
power expended by a horse. Since few people these days can relate to this unit, I 
wonder how intuitive this actually is to city folk. Maybe, the power of a car should be 
measured in Calories per hour, as these days many people tend to exercise in the gym. 
One food Calorie (capital 'C') is equal to 1,000 calories (lowercase 'c'), which in turn 
equals to 4,184 Joules. 


Converting Horsepower: A certain sports car has the horsepower (hp) reading of 600 hp 
at 6,100 rpm. Convert to Calories per hour and compare with the average energy 
expended on a treadmill in one hour. 
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Solution: We find that one horsepower is about 745.70 Watts which is 745.70 J/s which in 
turn is 
(746 J/s) x (3,600 sec/h)/ (4,184 J / Calorie) = 642 Calories/h for 1 hp 


Thus: 
600 hp corresponds to (600 hp) x (642 Calories /h hp) = 385,200 Calories /h. 


Almost one thousand times the typical average energy burned on a treadmill in a one- 
hour workout! 


The production of oil on the international stage is traditionally measured in 
barrels, namely because the current world currency is the US dollar and US has the 
imperial system of units. Even though there are many types of barrels, the oil barrel 
consists of 42 US gallons. 


Oil Spill: A very large oil spill occurred in the Gulf of Mexico in the spring of 2010. The 
oil continued to escape the well and pollute the ocean for months. Some estimates were 
putting the rate of spilling at 60,000 barrels a day. Convert this amount to liters per 
month. 


Solution: 
Daily spill rate in litres is: 

(60,000 barrels / day) x (42 gallons) x (3.79 litres / gallon) = 9,550,800 litres 
which in turn yields: 

(9,550,800 litres / day) x (30 days / month) = 286,524,000 litres per month. 


1.4. Units in Everyday Life 


One of the units for pressure, which is force per unit area, is the atmosphere 
(atm). One atm reflects the typical air pressure at sea level and it equals 101,325 Pascals. 
A unit of atmosphere, a unit of Pascal are derived units. In practical measurements, 
such as the air pressure in a car tire, one records the gauge pressure, which is the 
pressure in the tire over and above the atmospheric pressure. Therefore, the gauge 
pressure of 220 kPa, which is 32 psi, requires the pressure to be 32 psi above 
atmospheric pressure. 
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Gardening: Your house yard is 5 meters wide and 12 meters long. You want to order 
topsoil for your yard and you want it to be 6 cm high. The company that you hired 
wants the amount of soil in cubic feet. How many cubic feet of soil do you need? 


Solution: The volume of the topsoil should be: 
(5m)(12m)(0.06m) = 3.6 m3. 
Now 1 meter is about 3.28 feet, therefore 1 m3 corresponds to: 
(3.28 ft) = (3.28) ft 3= 35.29 ft3. 
Thus we need: 
(3.6 m3)(35.29 ft3/m3) = 127.04 ft? of topsoil for the yard. 


In some professions, it is acceptable to trace the actual hours worked. On the 
other hand, in some professions, it is universally accepted that the wages earned should 
not be calculated on the per hour basis as there is a lot of work performed outside the 
actual hours "worked". For example, it is rare to ask what is the hourly wage of an 
average hockey player in the NHL. Some people think that teachers work only when 
they are in the classroom but tasks such as marking, course preparations and 
administrative duties are many times overlooked, and calculating wages based on 
hourly work is not appropriate. 


Hourly Wages: Let's for fun calculate the hourly wage of an average NHL hockey 
player. 


Solution: In the 2005-2006 season the average NHL salary was 1.46 million. Suppose the 
team that our player was on did not make it to the playoffs and the player played only 
82 games in the season (no misses, no injuries). An average player is about 15 minutes 
on the ice (on average) during any game. That makes: 

(82 games/yr) x (15 mins/ game) = 1230 mins/yr, 


which is about 20.5 hours per year. If we consider the player working only during his 
ice time and he makes $1,460,000 per year then he makes about: 
($1,460,000 /year) x (1 year/20.5 hours of game time) = $71,220/h of game time, 


which is almost $20 a second, $1,187 per minute, $593.5 per 30 seconds (the average ice 
time during a shift). Also, an average NHL player makes about $17,805 per game. That 
is not bad. But we all know if they showed up only during the ice time with no 
preparation they would be in deep, deep trouble! 
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Chapter 2. Conventional Fossil-Fuel Energy vs. Nuclear 
Energy 


What is nuclear energy? It is the net energy released in a nuclear reaction. And 
what is a nuclear reaction?? In order to answer these questions, we need to understand 
something about the nature of matter and the conventional energy liberated by many 
conventional chemical reactions, including the chemical reactions of combustion — 
which are the reactions of hydrocarbons with oxygen in the atmosphere to produce 
carbon dioxide, water and heat. The heat is used to heat our living and working spaces. 
The heat energy is put to great use in electricity generation. The heat energy of 
combustion is also put to effective use in expanding the volume of the gases produced 
in your car engine’s cylinders; the expansion of the gasses push the pistons, which are 
attached mechanically to a system that takes this motion and translates it into the 
motion of driving the wheels. 


There are countless ways we make use of the net energy liberated in the chemical 
reactions of combustion. This energy is measured in joules, in BTU’s, in kWh’s, in 
Calories, in ergs, in eV’s — depending on the situation where the energy is used. Each 
application, each use seems to have its own special language, its own set of units. All of 
these seemingly different terms are all referring to the very same thing: energy. In 
previous sections of this text, you have gained some understanding of the nature of unit 
systems and the conversions between different units, and this knowledge will be 
important throughout this text. 


2.1. Conventional Fossil-Fuel Energy 


Matter is made up of atoms and molecules; molecules are collections of atoms 
bound together in certain specific combinations. In order to break apart a molecule into 
its individual constituent atoms, energy must be added to “the system”. Thus, when 
the reverse occurs and a collection of individual atoms are joined together to form a 
molecule, energy is then released. 


When we burn a hydrocarbon in the oxygen of the atmosphere (a combustion 
reaction) to produce carbon dioxide and water, we are in effect (not in reality! but that 
does not matter here for the math) taking the required number of molecules of our 
hydrocarbon, putting in energy and pulling apart the molecules into individual carbon 
and hydrogen atoms; we are also taking the required number of oxygen molecules, 
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putting in energy and pulling them all apart into individual oxygen atoms. Now, with 
this collection of individual carbon, hydrogen and oxygen atoms, we don’t add 
anything else, we don’t remove anything — we simply rearrange the atoms and join 
them together in a new fashion to make carbon dioxide molecules and water molecules. 


Given the number of atoms we started with, we make a certain specific number 
of these product molecules. When atoms are joined together to make molecules, energy 
is liberated. The question is: how much energy was required to break apart the reactant 
molecules compared to the amount of energy liberated when the product molecules are 
formed? In combustion reactions, the amount of energy input is far less than the 
amount of energy liberated and thus combustion reactions are net energy producers. We 
capture this liberated energy and put it to use to do work for us. 


We represent chemical reactions by reaction equations, where the number of each 
type of atom is the same on both sides of the arrow. Thus, we have simply rearranged a 
collection of atoms and bonded them together in new ways in the reaction. For example, 
natural gas (methane, CH4) is burned with the oxygen molecules in the air to produce 
carbon dioxide and water (Figure 1). The number of carbon atoms before and after the 
reaction is the same; the number of oxygen and hydrogen atoms is also the same before 
and after. The reaction equation is ‘balanced’ and accounts for all of the material in 
terms of numbers of atoms present. 


CH; + 202 —> CO: + 2 HO 


Figure 1. The chemical reaction of burning methane in an oxygen-rich environment 


When we combust hydrocarbons we are essentially liberating the chemical 
energy stored in the bonds of the molecules. How much energy are we talking about? 
The above example, combustion of CH4 shows what happens when we burn natural gas 
in our furnaces or hot-water-heaters. Natural gas is sold to us in units of cubic feet, ft°. 


Energy from Natural Gas: In a previous example in this text, a natural gas bill for one 
cold January in Calgary indicated that 12,000 ft? had been sold to the homeowner. Let’s 
assume that all of that natural gas was used to heat the home using a gas furnace. Can 
we tell how much energy was required to heat the home? 
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Solution: Previously we were given the information that 1 ft? of natural gas releases 
approximately 1020 BTUs of energy when burned. Thus, we can write a mathematical 
equality and the associated conversion factors: 


1 ft? = 1020 BTU 0.00098 ft? / BTU or 1020 BTU /ft3 
And we can calculate for the month in question: 
(12,000 ft) x (1020 BTU/ ft?) = 12,240,000 BTU were used. 
Further: 
(12,240,000 BTU/31 days) x (1day /24 h) x (1055 joules /BTU) = 


17.36 x 106 joules /h = 17.36 MJ per hour 


All of this energy comes to us as the result of the combustion of approximately 
255 kg of methane during that January month. The energy has not been created from 
nothing; it is the energy stored in the chemical bonds that has been liberated. 


The burning of other hydrocarbon fuels such as octane (CgHis) and cetane 
(C16H34) in gasoline and diesel, and other hydrocarbons in jet fuel, results in the same 
thing: chemical energy stored in the bonds of the molecules being liberated. Natural 
gas is also burned to provide heat to generate steam and the steam produced is used to 
drive turbines that generate electricity. Coal contains mixtures of various hydrocarbons 
and is burned to liberate energy that can be used to heat homes but is most often used 
in the generation of electricity. 


2.2. Structure of the Atom and Nuclear Energy 


Fossil fuels are composed of hydrocarbon molecules, that is molecules consisting 
primarily or exclusively of carbon and hydrogen atoms. We need not consider the 
composition of matter on any scale smaller than that of molecules and atoms in order to 
be able to understand the liberation of energy in the combustion of fossil fuels, the so- 
called ‘conventional energy sources’. In order to understand nuclear energy, we must 
consider the nature of matter on a sub-atomic scale; we must have some knowledge of 
that which atoms themselves are made. 


All atoms have a nucleus surrounded by electrons (electrons are particles 
carrying a -1 charge). The nucleus is a collection of protons (particles carrying a +1 


charge) and neutrons (particles carrying no charge). Ina 
neutral atom, the number of electrons is equal to the number of 
protons. The number of neutrons can vary considerably, 
depending on which type of atom we are talking about. A 
hydrogen atom has only one proton and one electron, and is 
made of no other particles. A helium atom has two protons, 
two neutrons and two electrons (Figure 2). A carbon atom has 6 
protons and 6 neutrons in its nucleus. An iron atom has 26 


protons and 30 neutrons. A uranium atom has 92 protons and Figure 2. An atom of 
146 neutrons. The number of protons determines which helium has two neutrons 
l ‘ihe at ecah t l h 6 t (green) and two protons 
elemen e atom 1s; Carbon atoms always Nave o protons, (red) in its nucleus with 
uranium atoms always have 92 protons. The number of two electrons (yellow) 


neutrons can vary somewhat; carbon can have 6 or 7 oreven8 in its orbit. 

neutrons. Each of these three situations results in a slightly 

different type of carbon atom. Uranium atoms can have 146 neutrons or 143 neutrons. 
Two atoms with the same number of protons and different numbers of neutrons are 
called isotopes. They almost always behave identically in chemical reactions (with 
some exceptional circumstances, beyond the scope of this text, which can be exploited 
in scientific research). The important difference between isotopes of the same element is 
in their behaviour when it comes to nuclear reactions. 


In chemical reactions, such as combustion, molecules are converted into other 
molecules by the rearrangement of atoms and the rearrangement of the connectivity of 
the various atoms. Bonds between atoms can be thought of as a mutual sharing of each 
other’s electrons. The distance between two atoms in a molecule is really the distance 
between those two atom’s nuclei. This distance between two nuclei is the shared space 
where the electrons from one atom share the space with the electrons from the 
neighbouring atom and in effect, both atoms share ‘ownership’ of the total number of 
electrons in the two atoms. (This is a simplification, but an extremely useful one for 
many purposes.) Atoms dissociate from and associate with other atoms by changing 
the shared space and the shared electrons. In this model, the nuclei do nothing but 
move around into new positions so the new bonds can be formed to make new 
molecules. 


Let’s examine an atom of helium (Figure 2). In the centre is the nucleus. The 
relative sizes of the inner and outer spheres is not to scale in the figure; in reality the 
diameter of an atom, including its sphere of electrons, ranges from 62 pm to 520 pm 
while the diameter of a proton is just 1.75 fm. If we consider the smallest of atoms, the 
hydrogen (H) atom, containing 1 electron and 1 proton only, the: 


diameter of the H atom is 62 pm = 62x 10-12 m 
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diameter of the H nucleus is 1.75 fm = 1.75 x 10-5 m 


We can see that the H atom is approximately 3,500 times bigger than the size of 
its central nucleus. The nucleus represents a miniscule volume within any given atom. 


The same thing cannot be said for the mass of the nucleus compared to the mass 
of the entire atom. Protons and neutrons have the same mass, approximately 1.67 x 10-27 
kg. On the other hand, the electron has a mass of approximately 9.1 x 103! kg. The 
nucleus of an atom is on the order of several thousand times more massive than the 
electrons surrounding it, depending on how many protons and neutrons there are in 
total in the nucleus. For most purposes, the nucleus accounts for virtually 100% of the 
mass of the atom. 


In contrast to chemical reactions, where one type of molecule is converted into 
other types of molecules by rearrangement of atoms, nuclear reactions occur when 
nuclei are converted into other types of nuclei by rearrangement of the nucleons. 
“Nucleons” are the particles in the nucleus - the protons and neutrons. Just as with 
chemical reactions, nuclear reactions can be expressed symbolically and we can balance 
the number of nucleons on each side of the nuclear equation. To represent a nuclear 
reaction, we use atomic symbols (to indicate the element type) and we use subscripts 
and superscripts to indicate the number of positive charges (the number of protons = 
the number of positive charges) and the total number of nucleons. The difference 
between the total number of nucleons and the number of protons gives us the number 
of neutrons. If we see a change in the number of protons in a nuclear reaction, it means 
we have seen the conversion of one type of element into another type of element! A 
nuclear reaction changes the constitution of the nuclei involved and so does indeed 
convert an atom of something into an atom of something else. A nuclear reaction equation 
keeps account the number of nucleons, not the number of atoms (as does a chemical 
reaction equation). It can be seen that the number of each type of atom does not 
balance in a nuclear reaction, but the number of nucleons does balance. That is one 
difference between nuclear reactions and conventional chemical reactions. 


In comparing the masses of various atomic nuclei with the total mass of their 
constituent nucleons, we find a very slight discrepancy in the numbers! Remember — 
mass is a measure of the amount of matter in something. If we consider a collection of 
individual particles and we have 8 protons, 8 neutrons and 8 electrons — all individually 
isolated from one another and not interacting in any way whatsoever, the total mass of 
all of these very slightly higher than the total mass of an oxygen atom. An oxygen atom 
is simply comprised of 8 protons, 8 neutrons and 8 electrons, so we would expect it to 
have than same mass as the total mass of its constituent particles. The fact that is does 


not is referred to as the mass defect. It is important to know that this defect occurs for 
all types of atoms on the periodic table. The extent of the mass defect differs from 
element to element, but is always there. 


The mass defect is also sometimes referred to as an energy defect, since the 
amount of missing mass can be converted to an amount of energy according to the 
famous equation: 

E=mc2, where: 
E = energy in joules 
m = mass in kg 
c = speed of light = 2.997 x 108 m s4 


A Challenge for You. Can you show that the units on both sides of the equation are 
equivalent? In other words, can you convert joules to kg m? s 2? 


The fact that an atom has less mass than its constituent particles, and there is a 
mass-energy equivalence according to E = mæ, means that when a whole collection of 
individual sub-atomic particles is brought together and they “bond” to make an atom, 
energy is released. This is analogous to a whole collection of individual isolated atoms 
being brought together to “bond” and form a molecule thereby releasing energy. 


Recall then, that breaking apart a molecule into individual atoms requires energy 
to be put into the system, it should come as no surprise that breaking apart an atomic 
nucleus into individual nucleons also requires the input of energy. Also recall that the 
energy liberated in a conventional fossil-fuel combustion reaction can be calculated as 
the difference between the amount of energy required to completely break apart the fuel 
molecules and the oxygen molecules and the amount of energy liberated by gathering 
the various atoms together to make new molecules of CO2 and H2O. Therefore, we may 
quite logically reach the conclusion that the amount of energy liberated in a nuclear 
reaction is simply the difference between the amount of energy required to break apart 
the fuel nucleus into its individual constituent protons and neutrons and the amount of 
energy liberated when this collection of particles rearranges itself into new nuclei. 


This is exactly the way we think of the energy liberated from the fission of a 
uranium-235 nucleus in a nuclear power reactor. When a collection of U-235 nuclei is 
bombarded with fast-moving neutrons, the incoming neutrons plus the U-235 nuclei 
react to form U-236. This U-236 nucleus is very unstable and immediately falls apart to 
for two new nuclei, Kr-92 and Ba-141, and releasing 3 fast-moving neutrons. Each of 
these three fast-moving neutrons can collide with another U-235 nucleus in the fuel and 
the same reaction will occur. Notice that the number of nucleons balances in this 
reaction: 
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1 neutron + 235 (neutrons + protons) in U-235 = 236 (neutrons + protons) in U-236 


236 (neutrons + protons) = 3 neutrons plus 92 (neutrons + protons) in Kr-92 
plus 141 (neutrons + protons) in Ba-141 


A great deal of energy is also liberated in this reaction, and the energy liberated 
comes from the energy stored in the nuclear forces in the atomic nuclei. Notice as well, 
we have balanced the number of nucleons, not the total mass of the nucleons. 


It is also worth noting that every time one U-235 nucleus undergoes a reaction 
like this, it releases 3 more neutrons. Each of these can initiate another reaction, each 
reaction producing another 3 neutrons and so on. Thus, this type of nuclear reaction is 
called a chain reaction and is self-sustaining. If the concentration of U-235 in the fuel is 
high enough and the neutrons produced are not captured deliberately by other 
technological means, the amount of energy released per unit time grows exponentially 
and we are faced with a run-away reaction, an atomic bomb. 


If, however, the concentration of U-235 in the fuel is lower and some of the 
neutrons are captured using so-called ‘control’ rods in a reactor, the amount of energy 
liberated per unit time can be kept constant and manageable and it can be used to 
generate electricity in a nuclear power plant. 


In nature, uranium ore contains approximately 0.72% U-235. A half-life is the 
amount of time it takes for half of this material to decay; therefore half of the original 
amount remains. U-235 has a half-life of 700 million years and left to its own devices, 
U-235 decays (converts via a nuclear reaction) to thorium, which it itself is unstable and 
will decay further. If the naturally occurring ore has its U-235 content increased 
through enrichment, it can be used in a controlled chain reaction and the energy 
released can be collected and put to use. If the ore is further enriched in U-235, it can be 
used in a nuclear bomb, where the chain-reaction is uncontrolled and results in an 
explosive release of energy. 


How much energy is released in a nuclear reaction? When nuclear physicists 
talk about the quantities of energy involved in various nuclear reactions and when they 
talk about the energy stored in a particular nucleus (the so-called nuclear binding 
energy) they typically report these energies in units of million electron-volts, MeV’s. 
Unless we ourselves are practicing physicists, these numbers will have little practical 
significance to us. We can, however, compare the amount of energy released in a 
nuclear reaction to the amount of energy produced or consumed in more familiar 
processes. 


The energy equivalence of 1 g of matter (according to E = mc?) is: 9 x 1013 J. 
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This is the same amount of energy as: 


the energy released in the nuclear fission of 1 kg of U-235 

the burning of 7 000 tons of coal (14 million pounds, 6.4 million kg) 

° the exploding of 20 000 tons of TNT (40 million pounds, 18 million kg) 

the total solar energy falling on 2 square miles of the earth’s surface / day 

This should put things in perspective! Comparing the relative energy of various 
technologies and energy sources and keeping in mind the efficiencies of different 
technologies is vitally important to making informed choices about the production, use 
and regulation of our energy industries. Keep in mind that there is more to consider 
than just energy production. The waste products of both conventional and nuclear 
energy must also be taken into account. The waste products of nuclear energy are 
different from the waste products of conventional fossil fuels and both pose 
significantly different challenges in their management. 


2.3. Radioactivity and Naturally Occurring Nuclear Reactions 


There are countless naturally-occurring nuclear reactions. The energy emitted 
from the sun is the result of nuclear fusion reactions, where hydrogen nuclei fuse 
together to produce helium nuclei. The background radiation levels found in nature 
here on earth are the result of nuclear reactions, specifically the decay of the nuclei of 
certain heavy nuclei into other, lighter nuclei. Along with the new nuclei being 
produced, we have learned that energy is also liberated. This energy can be in the form 
of the kinetic energy of fast-moving particles such as electrons, neutrons, protons or of 
very small helium nuclei, or it can be in the form of electromagnetic radiation. We will 
learn a little about the various forms of radiation and their potential hazards, but first — 
where does radioactive material come from? 


If being radioactive means something is decaying by a nuclear process, then why 
is there any of it present at all anymore and why is the amount of certain radioactive 
nuclei constant in nature? As an example, we can consider the radioactive carbon-14 
nucleus, C-14. This particular isotope of carbon is being formed in the upper 
atmosphere at a fairly constant rate, so any that decays is being replenished. Consider 
the following reaction: 
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In the diagram, above, the nuclear reaction between a nitrogen nucleus and a 
neutron is shown resulting in the formation of a carbon nucleus and a proton. Notice 
that the total number of nucleons is the same on both sides of the equation. The 
superscript represents the total number of nucleons; the subscript represents the 
number of protons in the nucleus. The number of protons defines the type of atom, the 
element. 


Neutrons are constantly coming into the earth’s atmosphere as part of the solar 
radiation impinging on the earth. Solar radiation is the radiation that results from the 
nuclear fusion reactions taking place in the sun. Sunlight that reaches our earth is 
another type of radiation (electromagnetic) that results from these solar fusion reactions. 


In the upper atmosphere, these in-coming neutrons strike the nuclei of nitrogen 
atoms. N2 molecules make up approximately 77% of our atmosphere. Depending on 
the rate of collisions between neutrons and nitrogen nuclei, and depending on the force 
with which they encounter each other, a certain number of nitrogen nuclei will absorb a 
neutron and then immediately ‘kick out’ a proton. Thus, the result is a nucleus with 
one less proton and one more neutron than the original nitrogen nucleus. Since we now 
have a nucleus with fewer protons, we can see that a new atom has been made - an 
atom of carbon. The proton kicked out will go on to some other fate in the atmosphere. 
The result is the manufacture of a particular isotope of carbon, a so-called C-14 atom. 
C-14 is an unstable, radioactive isotope of carbon. It undergoes radioactive decay and 
liberates some energy in this process. 
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Chapter 3. Understanding Radioactivity and Its Uses 


The discovery of radioactivity was a purely accidental discovery in physics. 
French physicist Henri Becquerel was interested in the ability of certain substances to 
transform the ultraviolet light falling on them into visible light - a phenomenon called 
‘fluorescence’. Becquerel kept a collection of minerals used for his studies in a desk 
drawer along with a set of un-exposed photographic plates. One day, he returned to the 
drawer to use the plates for taking photographs. When he developed the plates, he 
discovered they were terribly blurred as if previously exposed to light. He checked on 
the remaining, unused plates and discovered that although they were wrapped in thick 
black paper and kept in sealed boxes, they too seemed as if they had been exposed to 
light. But how? Becquerel was able to deduce that the presence of a piece or uranium 
ore, a pitchblende, in the same drawer as the sealed photographic plates, was enough 
blur the photographic plates through their protective layers. 


The existence of penetrating radiation that could pass through opaque materials 
as if it were passing through glass was a recognized fact in the late 1800’s. Wilhelm 
Roentgen had accidentally discovered X-ray radiation, which could pass right through 
cardboard, black paper and even the human body. While X-rays could only be 
produced by special equipment firing high-speed electrons at metallic targets, the 
radiation Becquerel discovered was being steadily emitted from a piece or uranium ore 
without any external provocation whatsoever. 


Early studies of this spontaneously emitted radiation showed that it was 
unaffected by physical and chemical conditions. It matters not whether we have a 
sample of pure, metallic uranium or chemical compound containing uranium, the 
radiation is emitted at a rate that is proportional to the amount of uranium in the 
sample. It was also discovered that many of the other heavier elements on the periodic 
table also spontaneously emit radiation according to the very same principles. 
Radiation, then, is something being emitted during the decay of an unstable nucleus. 


3.1 Radioactive Decay 


The process of natural radioactive decay is attributed to the intrinsic instability of 
the nuclei of certain elements, which results in the spontaneous, violent break-up of the 
nucleus into smaller fragments and includes ejection of energetic alpha and beta 
particles and high-energy electromagnetic radiation. Thus, alpha (a), beta ($) and 
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gamma (y) radiation are all products of various nuclear reactions — including 
radioactive decay — and they are all means by which nuclear energy is released in a 
nuclear reaction. The radiation carrying this energy can be damaging to living 
creatures. The nature and extent of the damage done depends on the type of radiation 
and the amount absorbed per unit time and on the total amount absorbed over time. 


radiation is a stream of high-energy a particles; an a particle is 
a helium nucleus consisting of 2 protons and 2 neutrons 


Bradiation is a stream of ß particles; a $ particle (sometimes written (-) is an 
electron 


y radiation is high-energy electromagnetic radiation and is not a stream of 
particles, as are a and ß radiation 


C-14 is one of the inherently unstable nuclei in nature; the ratio of protons to 
neutrons, given the total number of nucleons and the total mass of this nucleus means 
that it will spontaneously fall apart to produce something more stable. The diagram 
below shows the nuclear reaction that happens when C-14 decays (converts 
spontaneously) back into N-14 by kicking out an electron from the nucleus. An electron 
in the nucleus??? Indeed, an electron can be ejected by the nucleus — a neutron 
spontaneously ejects an electron and in so doing, the neutron is converted into a proton! 
Electrons do not exist in the nucleus, but they can be manufactured there. Thus, C-14 
decays by emitting B radiation; C-14 is said to be a B emitter. 


In the equation in the drawing below, the C-14 nucleus has 6 protons and 8 
neutrons; an electron is kicked out converting a neutron to a proton giving an N-14 
nucleus with 7 protons and 7 neutrons. One neutron of the original 8 has been 
converted to a proton. Changing the number of protons changes the identity of the 
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element. Thus, the carbon no longer exists; it has been converted to nitrogen. Here, the 
electron produced goes on to other processes in the atmosphere. 


The C-14 produced spreads evenly through the atmosphere and is incorporated into 
atmospheric CO2. Carbon dioxide dissolves in the oceans. Plants take up carbon 
dioxide via photosynthesis and incorporate the carbon atoms into their molecular 
structure. Animals ingest plants, and animals also eat other animals. Thus, every living 
thing is constantly exchanging carbon (some of which is C-14) with its environment as 
long as it is alive. The rate at which C-14 is produced in the atmosphere and 
incorporated into living things and the rate at which C-14 decays results in a stable 
natural abundance of C-14 in living things of 1 part per trillion. This means just 
0.0000000001% of all carbon atoms are C-14. 


As soon as a living thing ceases to live and therefore ceases to exchange carbon 
(including C-14) with its environment, the C-14 in its structure that naturally decays is 
no longer being replaced by C-14 being added from the environment. As soon as 
something dies, the level of C-14, which was constant at 1 part per trillion during life, 
begins declining steadily. The rate of decline in C-14 concentration is directly related to 
the relative instability of the C-14 nucleus itself. 


How unstable is the C-14 nucleus? The instability of one individual nucleus can 
be thought of as the probability that that nucleus will disintegrate in a specified period 
of time. A more stable nucleus would have a lower probability of disintegrating in a 
particular amount of time than a less stable nucleus would in that same time period. 


Since we are never considering single atoms or single nuclei when measuring 
things in our environment, it is much more convenient to relate the stability of a 
particular nucleus to its half-life. Remember that a half-life is the amount of time it 
takes for half of the nuclei to decay; therefore half of the original amount remains. At 
the end of two half-lives, one quarter the original amount (parent isotope) remains and 
the rest is the new (daughter) product; at the end of three half-lives, one eighth will 
remain, etc. This follows the mathematical principle of exponential decay, not linear 
decay since we are examining the amount of decay by the same relative amount in a unit 
of time (Figure 3). Linear decay would be when the nuclei would decay by the same 
absolute amount in a unit of time. 
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There is a simple way 
to mathematically describe 1p0 
the amount of material left 
after any number of half- 
lives. If we start with an 
amount Qe after n half-lives 
there will be an amount Q 
left according to: 
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Snes 50 atoms of parent 
22322333350 atoms of daughter 


eol product 


25 atoms of parent 
75 atoms of daughter 
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13 atoms 
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i; 94 atoms 
gas, for example, has a half- le of daughter 
30 je product 


life of 3.8 days. If we start 
with 5 mg of radon, how 
much will be left after a 


Percentage of radioactive isotope remaining 
oa 
oO 


month (30 days)? 10 
Solution: One month 00 1 2 3 4 
represents 30 days /3.8 days / Number of half-lives 
half-life = 8 half-lives. Thus: ©2011 Pearson Eason 1 
Q=Qx Figure 3. The radioactive decay of material is exponential. The 
1/2)" parent isotope will decay by 50 percent for each half life but the 
( ) amount of material will gradually decrease with each half life. 
Q=5 mg 
x (1/2)8 


Q=5 mg x (1/256) 
Q = 0.02 mg remaining. 


The shorter the half-life, the more unstable the nucleus and the higher the 
probability it will decay in a given time period. The half-life of C-14 is 5730 years. The 
rate of decay for an unstable nucleus is 50% per half-life. The shorter the half-life, the 
faster the rate of decay, the more unstable the nucleus. 


The rate-of-decay is another measure of the stability of a nucleus. Since we are 
dealing with a certain probability of decay happening, and the probability of decay is 
constant for a given nucleus type, the rate of decay (amount lost per unit time) must be 
represented by the loss of a constant percentage per unit time, where the percentage is 
the based on the amount of material present at the beginning of a particular time 
period. The amount present is constantly changing, but the percentage decaying stays 
the same no matter how much is present. We do not observe the loss of a constant 
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number of atoms per unit time. The number of radioactive atoms that decay per unit 
time is proportional to the number of atoms available and is entirely independent of the 
age of the atoms. 


Mathematically we say that the amount of C-14 present in non-living (previously 
alive) material decays exponentially, not in a linear fashion. The mathematical 
equations that are used to describe exponential decay also allow us to calculate the age 
of materials by measuring the amount of C-14 remaining — knowing that at the time of 
death of the organism, there was a specific ratio of C-12 to C-14 and sometime later, that 
ratio has changed and is measurable. Knowing these two values for the amount of C-14 
present, we can calculate the amount of time elapsed between death and present-day. 


3.2. Radiometric Dating 


These mathematical expressions can be used to take existing information — 
typically the amount of material present currently and the amount of material known to 
be present originally, and the half-life of that material — and solve for some information 
of interest — typically the amount of time taken to see the material decline from its 
original amount to the amount now measured. Depending on the information given, 
we can calculate what the half-life of a material is, we can calculate the rate of decline of 
the material or we can solve what the initial amount must have been. 


There are other nuclei that are unstable and can therefore be used to date various 
materials and artifacts. The half-life of a given nucleus and the known chemistry of 
particular elements makes then appropriate for dating items of very different ages. For 
instance, C-14 with a half-life of 5730 years is useful for dating materials containing 
biological carbon compounds that are on the order of tens of thousands of years old. 
Tritium, which is a radioactive isotope of hydrogen, has a half-life of 12.3 years and is 
useful in dating materials on the order of tens to hundreds of years old — when those 
materials include hydrogen atoms that have been in exchange with the environment at 
some point in time. 


While tritium dating can be used for biological materials with hydrocarbons in 
them, but it can also be used for dating water samples. Tritium is also produced in the 
atmosphere by the action of cosmic radiation and it is carried to the surface of the earth 
in precipitation. Tritium dating has been used to monitor the movement of masses of 
water, both in ocean currents and in sub-surface water reservoirs by taking samples 
from different depths. The tritium content tells how long it has been since this water 
came down in the form of rain or snow. 
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Geologists can tell us how long ago a particular era was by studying the 
thicknesses of various prehistoric deposits and comparing them with the rates of 
formation of sedimentary layers, but this method of dating geological material is rather 
inexact compared to radiometric dating methods. Neither C-14 nor tritium can be used 
to date rocks; U-238 is an abundant isotope of uranium and will, after undergoing a 
series of decay reactions, produce a stable (non-radioactive) isotope of lead, Pb-206. The 
half-life of U-238 is about 4.5 billion years. A piece of rock formed when molten 
material was pushed up through the earth’s crust and solidified and has remained 
unchanged ever since, can, if it contains some U-238 (which it most certainly will 
contain) have its age calculated by measuring the ratio of U-238 to Pb-206 in that rock. 
Pb-206 is not normally found in rocks, so we know that any amount of this substance 
must have been produced through radioactive decay. Although earlier we discussed 
some mass loss during the decay process due to radiation (Section 2.2), this amount is 
negligible when examining large quantities uranium and lead deposits. Once we can 
calculate the U-238 /Pb-206 ratio (parent / daughter ratio) we can use this information to 
infer the age of fossil remains that are embedded within the rock. 


All of these radiometric methods of dating materials are possible because the 
decay of radioactive elements is completely independent of physical and chemical 
conditions that the samples have been subjected to throughout their history. 


3.3. Radiation 


We know that something is “radioactive” when it emits “radiation” and that 
there are three types of ionizing radiation that we observe, both from nuclear reactions 
in reactors and weapons and from nuclear reactions (radioactive decay) found in 
nature. Ionizing radiation has the ability to fragment, damage molecules which it 
encounters. Fragmentation frequently results in the formation of charged species called 
ions, hence the name ‘ionizing radiation’. 


Alpha, beta and gamma radiation (Section 3.1) are all forms of ionizing radiation 
emitted from radioactive materials, but they are not equivalent in their effects. Due to 
the energy of the particles, alpha radiation causes the most damage in its path, but it is 
the least penetrating of the three types of radiation. Alpha particles will be stopped by 
a sheet of paper. Beta radiation is intermediate in its effects and it will be stopped by 
layers of clothing or by a thin sheet of aluminum foil. Gamma radiation, which is pure, 
high-energy electromagnetic radiation, is the most penetrating of the three types. 
Gamma radiation will penetrate living tissue and requires several feet of concrete or 
several inches of lead in order to be stopped (Figure 4). 
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Radiation itself does not cause the 
recipient to become radioactive. The 
radiation is absorbed and ceases to exist, 
although varying degrees of damage may be 
sustained in this absorption. Very high 


doses of radiation can cause great damage — 
cells can be damaged to the point of being 
killed; DNA can be damaged to the point of 
causing mutations in future cells produced. 
This is one way radiation can cause certain 
types of cancer. 


Paper Aluminium Lead 


Figure 4. Different levels of penetration by 
various types of radiation. 


If a source of radiation is ingested, either intentionally or accidentally, then the 
protection afforded by a layer of clothing or skin no longer exists and the internal 
organs and cells are far more susceptible to damage. For example, iodine is taken up by 
the thyroid gland quite readily. If radioactive iodine is present in a food or water 
supply and ingested, then this radioactive iodine will be taken up and will be a 
radiation emitter right in the thyroid gland itself and will cause damage. In the case of 
a leak of radioactive material from a compromised nuclear reactor, the population will 
be advised to consume non-radioactive isotopes of iodine (available from medical 
personnel) so that most if not all of the iodine taken up by the thyroid is not radioactive. 


The single largest source of radiation for most people is radon gas that can be 
trapped in our home. Radon is formed by the decay of uranium and thorium atoms, 
which are ubiquitous in soil and rock. Outside, this gas easily disperses and presents 
no danger; if it collects in basements it can build up to levels that cause unhealthy levels 
of radiation. Because it has a short half-life of 3.8 days, the energy per unit time emitted 
is quite high — compared to an element like U-238 with a half-life of 4.5 billion years! 
Even if each radon atom decay produced exactly the same amount of radiation as the 
decay of one uranium atom, the sheer total number of radon atoms decaying ina short 
time compared to the number of uranium atoms decaying in that same time period 
makes the radon far more hazardous to human health. 


Although the word radiation has a negative connotation, we are constantly 
exposed to various forms of radiation. Some other natural sources include solar 
radiation, cosmic radiation, decay of radioactive elements that are part of our 
biochemical processes such as potassium-40 (K-40), and various atmospheric and 
terrestrial radioactive elements such as C-14, tritium, U-238, U-235, etc. 


Radiation absorbed is measured in Rads (Roentgen Absorbed Dose), which is the 
actual amount of radiation absorbed. However, different types of radiation pose 
different biological dangers for humans such as cancer. A unit of one Rem (Roentgen 
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Equivalent Man) is the unit of radiation absorbed taking the different biological effects 
into account. 


We have: 
Rem = Rads x Quality Factor, 
where the Quality Factor (QF) depends on the type of radiation. 


X rays carry OF of 1, whereas alpha radiation causes 20 times more damage to 
humans and carries the QF factor of 20. The SI unit for this dose one Sievert with 1 Rem 
equaling 0.01 Sieverts. Another unit of radiation absorption is one Gray, which for X 
rays and gamma rays is equivalent to one sievert. 


Airport security and flight security are of some concern these days due to recent 
developments. Two types of scans were recently introduced: millimeter wave scanners 
(radio waves) are being introduced in some Canadian airports whereas some airports in 
the United States began to use another kind: backscatter-type machines that use low- 
level X-ray radiation to create a two-sided image. Naturally, as radiation exposure does 
cause cancer in humans, some people are worried. A typical backscatter machine 
delivers about 0.1 microsievert (uSv) of radiation. One sievert is a unit of radiation 
energy per unit of mass, namely 1 Sv equals one Joule per one kg. Therefore, a typical 
backscatter machine delivers about 107 Sv. To appease any concerns about radiation, we 
can compare the radiation exposure from one such scan to something we do quite often, 
for example, high altitude flying. Just for interest the average chest X-ray delivers 100 
microsieverts of radiation and a chest computed tomography (CT scan) delivers about 
10,000 microsieverts. 


Radiation: Verify the claim by a certain transportation security administration website 
that: The radiation dose from a single scan on a backscatter machine is the equivalent of 
two minutes of flying on an airplane. 


Solution: We find the cosmic radiation exposure at typical altitudes of commercial flights 
to be about 0.238 mrem (millirem) per hour (Health Physics Society, 2011). The 
somewhat older unit of 1 rem (Roentgen equivalent man) is used here and we have 100 
rem yielding 1 Sv or 1 rem yielding 0.01 Sv. Therefore, one hour exposure to flying at 
high altitudes delivers: 


0.238 mrem = 0.238 x 10-3 rem 
= 0.238 x 10> Sv 
= 2.38 uSv 
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= 23.8 x10-7Sv 


Thus one scan is equivalent to 1 hr/23.8 hours of flight exposure which translates to 
about 60 min/23.8 hr = 2.5 minutes of flying. So the claim is verified to a reasonable 
accuracy. 


Dental X-rays: Bitewing x-rays (4 films) deliver a radiation dosage of about 0.038 mSv. 
How many hours of high altitude flying would deliver the same amount of radiation 
exposure? 


Nuclear Radiation: High radiation areas in nuclear power plants are those areas where 
a person could receive more than 100 millirem in an hour. How many backscatter 
machine scans would that be? How many times is this radiation exposure higher than 
flying at high altitude? 


Background Radiation: The annual background radiation exposure for a typical 
American 370 mrems. How many backscatter machine scans would deliver the 
equivalent radiation dosage? 
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Chapter 4. Logic, Critical Thinking, and the Scientific 
Method 


Logic is the formal study of the principles of correct reasoning. The study of 
logic examines different forms that arguments may take, which arguments are valid and 
which are fallacious. Is logic empirical? In other words: are the laws of logic ultimately 
rooted in observations? Or are they ab initio? 


Logic is often divided into two types of reasoning: deductive and inductive. 
These can be studied formally in a philosophy course; in their more informal sense they 
are both part of science and scientific reasoning. The question “is it logical that...?” 
when applied to various fields of science - that is the interpretation of the physical 
world - is viewed from a different perspective than it would be in a philosophy class. 


Critical thinking also has a part in science. But what is “critical thinking”? 
“Critical thinking is the art of analyzing and evaluating thinking with a view to 
improving it.” (Foundation for Critical Thinking, 2006) It means: challenging 
assumptions; clearly and precisely formulating questions; drawing soundly-reasoned 
conclusions and testing them against relevant criteria; being open-minded with respect 
to alternative forms of thought, alternatives explanations and ideas; communicating 
effectively with others. 


Scientific reasoning is a form of critical thinking. Scientific reasoning: has a 
purpose, attempts to settle some question, solve some problem; is based on 
assumptions that are clearly stated, clearly justified and clearly relevant to the problem; 
is based on observations, objective evidence; is expressed through concepts and 
theories; draws inferences and gives interpretations; has specific, identifiable 
consequences. 


4.1. Inductive Reasoning 


Inductive reasoning is the process of reasoning that a general principle is true 
because the all of the special cases you've seen, so far, are true. This is where science and 
the scientific method and scientific reasoning begin. For example, if all the people 
you've ever met from a particular town have been very strange, you might then say "all 
the residents of this town are strange". That is inductive reasoning: constructing a 
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general principle from special cases. It goes in the opposite direction from deductive 
reasoning. 


Inductive reasoning is not logically ‘valid’. Just because all the people you 
happen to have met from a town were strange is no guarantee that all the people there 
are strange. Therefore, this form of reasoning has no part in a mathematical proof. 


However, inductive reasoning does play a part in the discovery of scientific or 
mathematical truths. For example, ancient mathematicians looked at triangles and 
noticed that their angle sums were all 180 degrees. After seeing that every triangle they 
tried to build, no matter what the shape, had an angle sum of 180 degrees, they came to 
the conclusion that this is something that is true of every triangle. They would have 
used inductive reasoning to write an axiom or premise to form the basis of an 
argument. Then they would have looked for a way to test it using deductive reasoning; 
that is, deduce it as a consequence of other known general properties of triangles. If 
they could, through sound principles of reasoning, arrive at a conclusion to the 
argument that all triangles have a sum-of-angles = 180, then the argument is valid. This 
was also the beginning of the field of geometry. 


4.2. Deductive Reasoning 


Deductive reasoning refers to the process of concluding that something must be 
true because it is a special case of a general principle that is taken to be true. For 
example, if you know the general principle that the sum of the angles in any triangle is 
always 180 degrees, and you have a particular triangle in mind, you can then conclude 
that the sum of the angles in your triangle is 180 degrees. 


But where do the general principles come from??? From experience! Where do 
they lead? To experiment! Deductive reasoning is logically valid and it is the 
fundamental method in testing scientific theories and is a necessary part of scientific 
reasoning. 


In summary, then: inductive reasoning is part of the discovery process whereby 
the observation of special cases leads one to suspect very strongly (though not know 
with absolute logical certainty) that some general principle is true. Deductive reasoning, 
on the other hand, is the method you would use to investigate based on logical certainty 
that the principle is either true or false. Remember the working definition of the word 
true in science — the evidence suggests that....; there is no evidence to believe it is not 
true so we will take it to be true as a starting point in further investigations. 
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Both are necessary parts of scientific and mathematical thinking. If you just 
started with the known properties of triangles and played around with them aimlessly 
using deductive reasoning, it is unlikely you would discover the fact that the angle sum 
is always 180 degrees (though if you did happen to discover it that way, you'd know it 
for certain). However, by noticing that it's true in all the examples you've ever seen, 
inductive reasoning leads you to suspect that this fact is true. Then, once your 
suspicions have given you a target and a direction for your deductive reasoning, you 
construct your rigorous logical proof using deductive reasoning. (You could, for 
example, mathematically prove that the sum of all angles in a triangle MUST be equal 
to 180 degrees). 


The hallmark of good deductive reasoning is that if we start from truths (i.e., 
things not shown to be false) and use sound reasoning, we will always arrive at truths, 
rather than at falsehoods. Deductive reasoning is said to be logical because of truth 
preservation in an argument. What is an argument? An argument is a set of 
statements, one of which — the conclusion — is supported by the others. 


What is deductive validity? An argument is deductively valid if and only if it is 
not possible for the premises to be true and the conclusion to be false. (This is part of 
the notion of truth preservation.) What does it mean to be deductively sound? An 
argument is deductively sound if and only if it is deductively valid and the premises 
are true. (This is also part of the notion of truth preservation.) 


Thus, a conclusion can be logically valid without being true, if the premises are 
false. If the premises are true, then the conclusion (soundly arrived at) is both valid and 
true. If the premises are false, then the conclusion (soundly arrived at) is logically valid 
but since the premises are not true, then the conclusions must be false. 


Sometimes we don’t know that the premises are false (we find out later) so we 
can say that the conclusion is logically valid and since we assume the premise to be 
true, we assume the conclusion to also be true. Perhaps later we find out independently 
the conclusion is not true; we are obliged to then acknowledge that the premises is/are 
false and we must re-state our premise and begin a new argument. See Textbox: John 
Snow and Cholera (Section 11.3) for a description of a real-life scenario where logic is 
credited with saving the lives of millions of people. 


Scientists often think in sets. ‘Sets’ are collections of things — objects, 
measurements, abstract notions and the like. Sets can be illustrated by Venn diagrams 
(Figure 5) and the relationships between the various sets can be logically deduced from 
these diagrams. 
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Yellow Fever: What relationships are visible in the Venn diagram (Figure 5)? Can any 
theories be proposed? Is the theory proposed a logical conclusion? Are there any 
predictions that can be made? 


Philosophers and mathematicians use truth tables to represent information and 
present arguments as propositions. A proposition is a statement that is assigned a 
“truth value” of either T or F. From the Venn diagram example, there are three 
propositions that can be written for a particular person. 


Consider a person, A, who lives in Africa: 
Person A lives in the Rift Valley — can be T of F (but not both) 
Person A has mosquito bites — can be T or F (but not both) 
Person A has Yellow Fever — can be T or F (but not both) 


For person A, there are eight possible locations in the Venn diagram and eight 
possible sets of values for the three propositions. 


Truth tables are the basis of something called Boolean algebra. In this branch of 
mathematics, a set of axioms (statements, premises) is put into a table and this algebra 
has only two numbers, 1 and 0, corresponding to the T and F of the truth tables. 
Boolean algebra is used for solving logical problems; solving problems this way 
becomes an exercise in computation. 


For example, the F and T (F = 0 and T = 1) can be the representation of a switch in 
an electrical circuit. F means the switch is open and no current flows. T means the 
switch is closed and current does flow, giving measurable electrical signal. The more 
switches, the more circuits, the more possibilities for T’s and F’s, the more premises or 
statements can be evaluated in a logical argument. Look familiar??......2" where n is 
number of ‘statements’ or ‘truth values’ or possible 1’s and 0’s - modern computing! It 
may surprise you to learn that George Boole who developed this branch of algebra to 
solve logical problems carried out his work in the middle of the nineteenth century, well 
before the modern computer was developed. 


Question for you: Do you consider a computer, science or technology? 
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In science, premises are the result of inductive reasoning. Premises are the 
starting point for a scientific argument. A premise is a general principle that is written 
down on the basis of a set of observations. A premise is assumed to be true and then is 
tested through applying arguments, the process of deductive reasoning, using the 
scientific method. Through this process, it may be shown to be not true or to be not 
untrue based on the new set of observations taken through experiment or further 
investigation. Often, the hypothesis of an experiment undertaking further investigation 
is the formal conclusion in a previous argument. If the conclusion is shown not to be 
true and the reasoning has been sound then the original premise (conclusion of the 
previous argument) must be not true. A different premise must be written using the 
original set of observations AND the new set of observations. A new experiment or 
study is conducted. 


Science is an iterative process; these iterations can be executed rather quickly as 
part of an experimental protocol where a theory is developed systematically and 
logically through care design, careful collection and analysis of data. Sometimes, 
however, these iterations can take decades or longer — theories (premises) arrived at 
through a wide range of observations (previously thought to be unrelated) and, the 
theory being very technologically difficult to test, may then be taken to be true because - 
SO FAR - data do not show otherwise. This is valid, logical reasoning. Sometimes it is 
impossible to test a theory, but with advances in technology (developed perhaps for 
completely different applications) new observations are collected that can be used to 
show the original theory (premise) is not true. The theory is then modified accordingly. 


4.3. Scientific Method 


The Scientific Method, then, isn’t a set of procedures for designing experiments 
or carrying out studies as is commonly supposed. The Scientific Method is really a 
discipline of critical thinking that subjects ideas to review and independent repetition in 
order to reduce the level of uncertainty we may have about how the physical world 
works. The scientific method is often written as a series of steps that include: 


observation - collecting and being curious about data or observations and seeing a 
question or a problem - being curious - how, why? (This is part of inductive 
reasoning — working from particular examples and supposing a more general 
principle must be true.) The observations lead to questions that need to be 
answered to satisfy curiosity. 
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forming a hypothesis - this would be the premise or premises — the beginning of 
deductive reasoning — and making a prediction - something that can be tested. 
The hypothesis is an educated guess about the answer to the question. 
Hypotheses are developed through background research into the topic of 
interest after having a “hunch” based on some interesting observations about 
the physical world. A hypothesis has to be testable and it has to be falsifiable. 
A hypothesis predicts the outcome of a particular action. A hypothesis is a 
statement, not a question. 


designing an experiment - collecting data, observations (these results are statements 
in the formal argument) The hypothesis must be properly tested in order for 
the experiment to be logically valid. 


examining and interpreting - drawing a conclusion from the results 


evaluating the results in the context of the hypothesis — the prediction made - was the 
original premise shown to be not true in the light of the experiment or study? If 
not shown to be ‘not true’, the original premise is taken to be true — so far as we 
now know. 


peer review, evaluation, publication - the work can now be examined and tested by 
others, perhaps in the light of new evidence that was not available at the time of 
the original work. The hypothesis will either be shown to be not true or not 
‘not true’ (i.e., continues to be true as far as we know). The hypothesis may be 
expanded to include a broader prediction of behaviour. 


Science does not ‘prove’ things to be true. Science can only show things to be 
‘not true’ or can fail to show they are ‘not true’. This is why science is all about 
falsification. Failure to show something is ‘not true’ is not equivalent to proving that the 
something in question is true. Science can reduce the uncertainty in our knowledge. 


Some microbiological examples are phenomena that are either observed or can 
be tested in a few days or a few weeks and could be carried out with the technology of 
the day. But, science does not require this to be so! There are other examples where it 
might at first seem that there is something that is 100% unknowable - the structure and 
working of the planet earth, the solar system, the galaxies etc. Sometimes, science 
moves forward at a pace far too slow to be carried out by one person in a single human 
lifetime worth of effort, or even by a collection of dedicated individuals in a single 
human generation. This is why science is about collaboration and multiple testing of 
ideas over long periods of time. 
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Ultimately, with copious amounts of experiments and studies on a particular 
hypothesis and failure to show the hypothesis to be ‘not true’, the hypothesis may come 
to be known as a theory. A theory is a proposed explanation of observed phenomena. 
Observed phenomena can be simple observations, measurements of individual 
properties. Observed phenomena in this sense can also mean a whole collection of 
various kinds of data - measurements of all sorts of properties - and the relationships 
between the different types of data. A theory is a proposed explanation for complex 
behaviour of any part of the physical world around us - this includes the biological 
world as well. Theories are developed in a way consistent with the scientific method, 
which is ultimately based on the rules of logic and the principles of sound reasoning. A 
scientific theory is sound if can be shown to be valid and true according to the premises 
and the evidence used in the reasoning. If the premises are true according to all 
experience so far, and the evidence has been properly collected and logically analyzed 
according to experience so far, then a theory can be proposed to explain the evidence. 


If a theory has stood the test of time and has never been shown to be a false 
explanation despite enormous amounts of evidence being analyzed, that theory may 
well become a law. The ‘law of gravity’ was once a theory. The ‘laws of 
thermodynamics’ were once theories. These laws do not mean science has proven it to 
be ‘true’, it means that centuries of robust science has never shown it to be ‘not true’. 


The word theory in science is generally reserved for those explanations that after 
numerous tests of an idea, have never been shown to be false. So a ‘theory’ is a fairly 
robust explanation; but a theory is not infallible! Theories evolve and can even be 
completely discredited in the face of new evidence. A perfect example is our present- 
day knowledge of the structure and behaviour of the planet earth. One way to help 
understand how theories develop and change over time is to examine the work of 
Alfred Wegener. Wegener was a geologist who used geological and biological evidence 


to challenge the existing belief of the time, and proposed the theory of Continental Drift. 


This theory has since been refined to the currently accepted theory of Plate Tectonics. 
Let’s examine this process in more detail. 


4.3.a. A Changing Theory 


As the beginning of WWI approached, Alfred Wegener was formalizing his 
thoughts, and on January 6, 1912, he openly announced his controversial ideas to great 
surprise at the Geological Association in Frankfurt, Germany. Then in 1915, he 
published a paper titled The Origin of Continents and Oceans. He was proposing a 
radical paradigm shift. 
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Up to this point in time, most people within the western world believed in a 
literal interpretation of the Bible where the lands were formed on the third day of 
creation and ever since, the surface of the Earth has been fixed in time and place. 
However the scientific community had already made critical observations that could 
not be overlooked, or convincingly explained, within that frame of thinking. These 
observations involved geological, paleontological, astronomical and climatic 
phenomenon around the world, and they all needed a cohesive theory to tie them 
together. To explain these observations, Wegener put forward the theory of 
“Continental Drift.” 


To support his ideas Alfred Wegener used casual observations, physical evidence 
and experimental data. His first casual observation was that continents seemed to fit 
together like a jigsaw puzzle, especially Africa and South America. Many scientists and 
theorists had noted similar observations in the past; however, Wegener went beyond 
speculation. 


Researchers were gathering physical evidence from numerous and seemingly 
independent fields, and it was Wegener that compiled many of these papers and began 
formulating a cohesive idea. For example, research showed a seaming coincidence of 
similar fossils and similar geologic materials that matched one another, but on 
continents now thousands of kilometers apart (Figure 6). This paleontological evidence 


came from people such as Edward Suess (1831-1914) who noted that fossil specimens of 


a cold climate fern, Glossopteris, could be found 
today in warm climate areas of southern Africa, 
South America and India (also Australia). 
Therefore, these places must all have been 
connected somehow in the past, and likely under 
different climatic conditions. Suess called the 
suspected single landmass Gondwanaland. 


Wegener also noted papers that indicated a 
reptile fossil, the Mesosaurus that was found in 
eastern South America and western Africa as 
though its roughly circular range had been sliced 
in half and separated. If South America and Africa 
were placed together in jigsaw puzzle fashion, the 
range boundaries of Mesosaurus would align 
(Figure 6). 
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Fossits of the fern 
is, found 


in all of the southem 


Figure 6. The historical range of Mesosaurus fossils (and fossils of other organisms). Fossil 
remains appear to link these landmasses during the late Paleozoic and early Mesozoic eras. Credit: 
U.S. Geological Survey 


Leading hypotheses at the time suggested a few possibilities to explain these 
fossil phenomena: 


1) there was a large contiguous land bridge that later subsided, 
2) stepping stone islands allowed access across the oceans, or 
3) a “rafting event’ had carried Mesosaurus and other fossils back and forth. 


None of these hypotheses answered the fundamental question of why these plants and 
animals lived only in specific geographic ranges rather than being widely distributed 
everywhere. Wegener proposed that the land itself had split apart taking Mesosaurus 
descendants off in two separate directions. 


Geologically, the observations and scientific research seemed to support intimate 
connections between continents now separated by vast oceans. If two parcels of land 
had previously been connected, then one could also safely assume that where a break 
was made in the land, one would also find matching pieces, like a broken stick being 
patched back together. Indeed, the Appalachian Mountains of eastern North America 
run northeasterly up the coastline, through Newfoundland and disappear into the 
Atlantic Ocean. Similarly, the Caledonian Mountains run through Ireland, the northern 
British Isles and across the top of Norway, Sweden and Finland; this geological 
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formation happens to match in structure and age to the Appalachian Mountains (Figure 
7). 


Indeed it was Greenland that played an important part in Wegener’s attempt at 
finding evidence for his idea. Time sequences and the understanding of how long it 
takes for certain landscapes to form were crude and rudimentary at the time. Wegener 
had thought that Greenland and Europe were spreading farther apart, which is indeed 
true. However, the rate of that movement was largely inaccurate. Today, we know it to 
be about 5 cm per year. Wegener used lunar measurements to plot the longitude of 
eastern Greenland in relation to Western Europe and noted, based on his own 
experimental data and previous measurements by others, that the landmasses were 
spreading apart. These types of measurements lead to a fundamental calculation 
problem though. Small inaccuracies in measurements (especially of celestial bodies) can 
lead to significant variation in results. 


Figure 7. Similar geological materials provided evidence for Wegener’s theory. A. These 
mountain ranges on different continents are comparable in age and structure. B. Prior to 
continental movement, these mountains aligned in one contiguous range. 


The measurements taken by Wegener fell prey to this mathematical calculation error. 
Wegener’s miniscule miscalculations suggested that continents were moving apart at 
tens of meters per year versus only a few centimetres. Hence, the continents were 
‘drifting’ or ‘plowing’ across the oceans. With no understanding of the mechanism that 
would or even could move the continents, Wegener suggested that tides and lunar pull 
had been the mechanism that was shifting the landmasses. This suggestion was 
criticized at the time, with good reason. 


But Wegener had more evidence to support his theory of continental drift. First 
and foremost a meteorologist and climatologist, it was his interests in weather patterns 
that took him initially to Greenland and lead him to investigate paleoclimatic data. 
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These data showed that the global climate had shifted dramatically in the past, with ice 
sheets being extensive about 220-300 million years ago. However, the physical evidence 
was found in places that now reside on land close to the 
equator. These are places too hot to form ice and always 
have been! Wegener suggested that if those landmasses 
had ice on them, they must have been closer to the poles 
and ‘drifted’ to the equator later, lending additional 
support to the theory of continental drift. However, the 
scientific community was not convinced. 


In the winter of 1930 on another expedition to 
Greenland, Wegener died in an attempt to bring supplies to a second camp by dogsled. 
After his death, the idea of “Continental Drift” was quietly put on the shelf while the 
scientific community looked elsewhere for plausible explanations. 


4.3.b. Development of a New Theory 


It would take almost a half of a century before the scientific community would 
accept Wegener’s fundamental premise and it wasn’t until the late 1960s that students 
began to learn about continental drift. Interestingly, Wegener’s original hypothesis was 
born out of WWI and this idea was resurrected only after technical advances from 
WWII. 


In the 1930s, the field of seismology led to the inference that the crust of the earth 
was in fact ‘floating’ on various internal layers, some of which are less than solid. There 
was no evidence to support these ideas until WWII when the US Navy began 
conducting seafloor mapping (through echo-sounding) during their voyages to various 
battles in the Pacific Ocean. 


Throughout the 1950s, oceans were being further mapped with the help of 
magnetometers. Because oceanic crust is largely basalt and basalt has high levels of 
magnetite in it, measurements could be related to the magnetic properties of the sea- 
floor. As magma cools, the magnetite aligns with the earth’s magnetic field and the 
rocks contain distinct geomagnetic striping; a kind of geologic fingerprint. 
Geomagnetic reversal, where the earth’s magnetic field shifts poles such that north 
becomes south and south becomes north, had already been recognized. However as 
data accumulated, there was a discernible pattern of magnetic striping across the sea 
floor. From the middle of the Atlantic Ocean to the shores of North America for 
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example, there was a clear pattern of magnetic striping that was a mirror image to the 
opposite side of the ocean floor. 


Using data from WWII, Harry H. Hess presented a hypothesis suggesting that 
the sea floor, split in various locations around the globe, is where new crust was 
forming and then moving laterally outwards. He called it sea-floor spreading. As 
magma comes up from the mantle, cooled and solidified, the newest material would 
then be closest to the split in the crust and the oldest material would be up against the 
continents. But in order to determine the age of the material, we needed a dating tool. 
That tool is known as radiometric dating and it allows scientists to calculate the age of 
particular materials (See Chapter 3). Using this technique on the ocean floor showed 
that the youngest rock material was indeed at the mid-ocean ridges with age increasing 
away from these areas, supporting the concept of sea-floor spreading. 


The progression of premises, arguments and conclusions that ultimately lead to 
our current theory of plate tectonics was something that arose through seemingly 
unconnected activities and observations carried out through decades and across a wide 
variety of scientific and technological fields. To summarize, let’s examine the 
development of the theory of Plate Tectonics in relation to a changing premise. 


The first theory of how physical features on the earth develop and change was 
valid, and was considered true because the premise was taken to be true (absent of any 
other info, this is scientifically reasonable). The premise was that the various features of 
the earth: the continents, lakes, oceans, etc., are all in the same spot they have always 
been; the locations of earth’s features are unchanging. The processes of erosion and 
uplifting of continental material explained the formation of different features of the 
Earth such as mountains, valleys, lakes, and grasslands. This was a perfectly logical 
conclusion IF the original premise were true. For a long time, there was no way of 
knowing whether this premise was true or not — but it was certainly not shown to be 
‘not true’. 


New information arose from new observations that were the result a wide 
variety of fossil records, mapping of the ocean floor (all data that was noticed or 
collected by others for purposes quite unrelated to the study of the structure of the earth 
and its features) - the theory could now be tested by using it as the premise for a new 
argument. Through sound reasoning, the previous conclusion (everything is in a fixed, 
immovable position on the surface of the earth and all features are the result of things 
like erosion) that was considered valid was now shown to be not true. If continents 
never moved, why did fossils found along the edge of one continent match those found 
an ocean away along the edge of another continent? The theory was highly suspect, in 
the eyes of a few conscientious scientific and critical thinkers. No one had any idea 
whatsoever HOW continents could move, features could be formed in ways never 
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before considered, but these scientists knew from the evidence that the old ideas were 
simply not a good explanation anymore. 


A new idea is proposed that was consistent with the original observations AND 
was consistent with the new observations. This is a new theory — if technology was 
available at the time, it could be tested; however, the technology required for 
experimental testing was not available. In other words, it was never shown to be ‘not 
true’. Many years after this theory was first proposed, new observations presented 
themselves from completely unexpected sources, lending further support to the theory 
of continental drift. 


Even newer, more sophisticated data were obtained (again, completely 
fortuitously), some experiments are performed where technologically possible and the 
conclusions were such that this premise (theory) is not found to be ‘not true’ and the 
evidence amassed started to suggest a mechanism for this movement of the continents 
around the globe. The theory of continental drift was modified in light of twentieth- 
century data and we now have the theory of Plate Tectonics — arrived at by almost a 
century of critical thinking by astute scientists. 


It is entirely possible that new data in the near or far-distant future may emerge 
that does not fully support this current version of the theory of Plate Tectonics. If and 


when that happens, the iterative process continues and refined or new theories emerge. 
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Chapter 5. Evaluating Information 


Never in human history has information been so accessible. But with this 
increase in the availability of information comes the need for us to be able to critically 
evaluate what we see, read or hear since many of the facts might be influenced by the 
thoughts, ideas and opinions of others. In this section, you will learn strategies that can 
be applied to various types of information to accurately determine the validity and 
usefulness of what is being disseminated. 


Whether information is being reported in the media or even if you're reading a 
peer-reviewed article, it is important to be able to determine how a particular study was 
conducted. Depending on the legitimacy of the source, you might be provided with all 
the pertinent details to be able to assess the methodology of a study, or you might need 
to do some extra research. For example, an article in the Calgary Sun might be reporting 
on a new scientific finding and instead of detailing the methods of the experiment, they 
provided the name and institution of the author of the study. On the other hand, a 
scholarly journal such as Nature might not report the methods of a study; they might 
send you to their website for supplemental information (a common practice to save on 
the cost of printing). Regardless of the quality of the source, to make an informed 
decision about the validity of an argument or premise, you need to be able to evaluate 
the experimental methodology of a study. 


So what information do we need to know? 


5.1. Sample and Population 


To evaluate the effectiveness of a study, it is important to know the sample — the 
subjects of the study. The sample is the individuals or objects that were used by the 
experimenters to collect data. The sample should be representative of the population. 
In other words, the sample should have similar characteristics to the larger group that is 
being studied (i.e., population). 


For example, if a researcher from Health Canada were investigating the 
prevalence of heart disease in Calgary, she wouldn’t talk to every single citizen of the 
city (the population), that would just take too long; rather she would investigate a 
subset (or sample) from the entire population of Calgary. In this same scenario, if this 
researcher only selected participants from a single neighborhood, do you think this 
sample would be representative of the Calgary population? 
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5.2. Sample size 


The minimum sample size that is necessary for a study to be seen as valid can 
vary tremendously. Mathematically, there are ways to calculate a minimum sample size 
that would needed to be able to falsify an experiment but this is not always done in 
many scientific studies. In many cases, sample size depends on the organism under 
investigation and in some cases the discipline. For example, a single fossil remain of 
new species of dinosaur can yield lots of important information, but should we 
discount this finding because it is based on only one sample? Definitely not! However, if 
you read a study investigating the effectiveness and side effects of a new drug, 
wouldn't you want this drug tested on more than one individual? 


This is where the peer-review process is extremely important. We can make our 
own personal judgments about sample size but experts in the field understand some of 
the constraints and issues with experimental testing (especially on humans!) and this is 
one reason why peer-reviewed literature is revered above articles that are not peer- 
reviewed. 


5.3. Collecting Data 


To be able to determine if the appropriate data were collected, you need to 
understand the purpose of the study or hypothesis. Once you know the hypothesis, 
you can then assess if the correct characteristics of the population are being investigated 
by this particular study: these are called population parameters. Since researchers can’t 
measure every individual in the population, they collect numbers or observations from 
a sample: these raw data are summarized and are called sample statistics. 


On the other hand, sometimes confounding variables might influence the results 
of a particular study. Confounding variables are things that interact with the variables 
of interest and may cause the researcher to analyze the results incorrectly. Confounding 
variables might mask true relationships or they might cause the researchers to falsely 
reject the null hypothesis. (The null hypothesis states that there is no relationship 
among the variables; this is generally the default hypothesis.) Ideally, researchers 
should try and eliminate or control for factors that might influence the results of a 
study. (Note: this is a statement about controlling factors, not about experimental controls). 
In many fields, it is impossible to completely eliminate all confounding variables, 
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especially outside of a controlled laboratory experiment, but one way to minimize their 
effect is to use a well-planned experimental design. 


Confounding Variables: Let's look at recent study. What are some confounding 
variables that the researchers should take into account? In other words, what other 
factors might influence the relationship between drinking and longevity? 


Charles Holahan and colleagues (2010) investigated whether individuals that are heavy 
drinkers (more than 2 alcoholic drinks/day) die at a younger age than non-drinkers. 
These researchers followed 1824 people between the ages of 55 and 65 for almost 20 
years and examined rates of survival for heavy drinkers, moderate drinkers (1-2 
drinks /day) and non-drinkers. Study Conclusion: After accounting for many different 
confounding variables, moderate drinkers had the lowest death rate followed by heavy 
drinkers, with non-drinkers having significantly higher rates of mortality (Holahan et 
al., 2010). 


5.4. Bias 


When we think of bias, we automatically respond with a negative connotation. 
However, bias is in everything we do, hear or say. As humans, it is impossible to 
completely separate out our own feelings and opinions from facts. This is one reason 
why the scientific approach (See Section 4.3) is so important; it provides a framework to 
maximize neutrality when investigating scientific problems. 


Just because bias is common doesn't mean that we shouldn't be able to recognize 
the bias in a situation so that we can make the most informed decision about whether 
we can accept particular information. We will review some of the more common types 
of bias and discuss techniques to minimize or avoid their influence. 


5.4.a. Selection and Participation Bias 


Constructing a robust experimental protocol is one way to avoid selection and 
participation bias. Both of these types of bias revolve around selection of the study 
subjects. Selection bias might be apparent if the researcher selects participants for the 
study. Perhaps the researcher has a conscious or subconscious preference for particular 
individuals or organisms; this could influence the results of the study. 
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On the other hand, when individuals volunteer to participate in a study, there 
might be participation bias. Individuals that select to participate might have a vested 
interest in the outcome of the study or have a preconceived notion about how the 
results should turn out, thus possibly influencing data and interpretation of the results. 
This does not necessarily mean that participants are trying to be deceptive or to ruin the 
study; they might honestly believe in a certain treatment and thus report results 
consistent with these beliefs. 


A better way to determine who participates in a study is to use a standard 
sampling protocol. The most common sampling protocols are: 


Random Sampling: Every individual has an equal chance of being represented. 
A random number table or a computer is used to generate random samples. 


Systematic Sampling: A simple sampling protocol based on a particular pattern 
of selection. For example, each 6t! person that walks down a hall might be 
selected for an interview. 


Convenience Sampling: The sample is readily accessible to the researchers. For 
example, many low-risk experiments in psychology regularly use undergraduate 
students as participants in their experiments: many of you have probably 
participated in these types of studies as part of your coursework for Introductory 
Psychology. 


Stratified Sampling: When researchers are investigating differences among 
particular groups, or strata, stratified sampling is used. This sampling protocol 
identifies subgroups of the population and then identifies samples (usually using 
random sampling, as described above) from these subgroups. An example might 
be if a researcher were interested in how anti-malarial medication affects 
different people of different races. The subgroups would be populations of 
people from different races and individuals within these groups would be 
selected for inclusion into the study based on random sampling, for example. 


Regardless of the type of sampling protocol, keep in mind that the sample should 
be representative of the larger population. If the sample is non-representative, then the 
sample likely doesn’t portray the true variability of the population. Since individuals 
vary tremendously in their responses, a larger sample has a greater chance of being 
truly representative of the population. However, there might be other factors that limit 
the size of the sample (See Section 5.2). 
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5.4.b. Confirmation Bias 


Another form of bias is confirmation bias where individuals interpret 
information in a manner that supports their hypotheses or preconceptions, or rejects 
information that fails to support their point of view. This cognitive bias can have 
significant impacts on scientific findings. Confirmation bias, also called confirmatory 
bias, is not just isolated to the interpretation of evidence; it can also influence the 
collection of data. Study participants might only remember selective details about a 
particular event due to their own preconceived notions about its effects, thus 
manipulating the integrity of data collected. 


A distinguishing feature of the scientific process is that hypotheses should be 
falsifiable. Thus, it is important for researchers to have clear experimental designs with 
controls or blinds, if necessary (See Section 5.5). This process promotes repeatability in 
science, where individuals having different views on a particular issue can test ideas 
and hypotheses on their own. Science has also adopted the peer review process that 
further controls for any influence of confirmation bias in scholarly work (See Section 
4.3.). 


5.4.c. Funding bias 


Another type of bias that is common in scholarly literature is funding bias. 
Traditionally, most research funding in science and mathematics came from government 
sources. With recent funding cutbacks, many individuals must now rely on grants from 
other sources, such as corporations or charitable organizations, to fund research 
programs. In 1965, more than 60% of all research and development in the United States 
was funded by government sources things changed and in 2006, 65% of all U.S. research 
and development was funded by private organizations (Washburn, 2007). This is not to 
say that government funding is free of bias but when for-profit corporations have a 
vested interest in outcomes, it could change the face of research. 


But what exactly is the role of these funding agencies in the scientific process? Do 
they, or can they influence the results of a study? In an idealist world, the answer would 
be no, but in reality, there have been instances where findings have been suppressed or 
manipulated by funding agencies. 


To maintain scientific integrity, it is extremely important that funding agencies 
are kept at arms length from researchers. Some publications in the primary literature, 
such as Science and Nature, detail guidelines and definitions whereby authors must 
declare any possible conflict of interest. Any conflicts are accessible to reviewers, and in 
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many instances, a statement is also added to the publication (usually in the 
Acknowledgements section). 


There are some legitimate situations where researchers do not have the freedom 
to publish. One example might be a researcher in a pharmaceutical company. When 
someone is employed by a company to investigate a particular drug, they are doing this 
job as an employee and are subject to rules and regulations of the company. Because 
pharmaceutical companies are private corporations that compete with other companies, 
there may be significant economic implications to the dissemination of information 
prior to securing patents and copyright protection. Thus, the employer may forbid the 
publication of scientific information. In this case, the funding agency has a significant 
role on the dissemination of science; something that will eventually be taken into 
account by peer-reviewers, if and when this information comes to light. 


5.4.d. Publication bias 


It is much more difficult to determine the likelihood of publication bias but in 
particular fields of study, this is of great concern. Publication bias is when a journal, 
editor or reviewers reject articles for publication because they are not new or do not 
have directional results. For example, studies that support the null hypothesis are not 
indicating any effect of a treatment or experimental manipulation. In science, you 
regularly hear (especially in laboratory courses) that no differences are still important 
results. However, when you try and publish these results, you might run into 
publication bias whereby the importance of these findings are outweighed by other 
studies that show a positive or negative effect of a treatment. Publication bias has less 
to do with the science than it does with the economics of the publication process but it 
can have significant implications for science. 


5.5. Experimental Design 


What are the characteristics of a well-designed study? First, a study must be 
repeatable. The description of the experimental design must be detailed enough so that 
another researcher has the ability to replicate the study — the researcher might replicate 
the study exactly, but in many cases the experiment is repeated using a different 
organism (study species), timeframe, dose, etc. 


The experimental protocol should also take into account the organism of interest. 
If the study organism is knowledgeable about the study, their opinions and attitudes 
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might somehow bias the results of the study (See Section 5.4). To avoid this bias, 
researchers can design an experimental study to minimize these influences; this is 
referred to as a blind or double-blind study. A blind study is where the participant is 
unaware of the true premise of the study; they are usually told that the study is 
investigating a different question, or they are not aware of a particular manipulation. 
Double-blind studies differ in that both the study participant and the researcher are 
unaware of the experimental manipulation. Obviously the researcher will be aware of 
the premise and hypothesis but they are blind in that they are unaware of whether a 
participant receives the experimental manipulation. The aim of this design is to 
minimize any influence from the placebo effect (See Textbox: The Placebo Effect). 


For example, if a researcher were investigating whether supplements of Echinacea 
prevent the common cold, one method of data collection would be to ask participants if 
they have any symptoms of a cold. Since 
Echinacea is widely taken to prevent colds and 
participants are reporting their own symptoms, 
they might be influenced by their beliefs that 
Echinacea works effectively, and report results 
accordingly. By having a blind study, the 
participant doesn’t know if they are taking 
Echinacea or the sugar pill. However, the best 
study would be double-blinded where both the 
participants and the researchers do not know 
whether the subject is receiving the supplement 
or a sugar pill. This protocol minimizes any 
spurious effects reported by the participant and 
any observations that the researcher makes, 
due to the placebo effect. 


5.6. Statistical Analysis 


Before we can do any analysis, or assess the analysis of others, we must be 
familiar with types of data that are collected. In many studies, data are described or 
interpreted in the form of statistics. To be able to describe data, we must first 
understand the data distribution of a variable. By graphing the distribution of the 
variable as a histogram, we can visualize the range of values. These distributions can 
then be discussed in terms of shape, symmetry and spread. 
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Variables that have a distribution that resemble a bell-curve with one central 
peak are referred to as having a normal distribution. Data with a normal distribution 
have several key attributes that we can use to identify this type of distribution. The 
mean, the median and the mode are very close, ideally the same value, and as a result 
we have a single mode distribution. Moreover, larger deviations from the mean become 
increasingly rare and the distribution is symmetric about its mean. 


Quite often data are normally distributed and there are reasons why this normal 
distribution is so prevalent in nature. Normal distributions arise when data are a result 
of averaging processes: genetical, environmental or anything else. There is a deep 
mathematical theorem behind this called the Central Limit Theorem. 


For example, the heights of boys in all grade five classes in Calgary are normally 
distributed. We tend to have a single modal distribution whose mean, median and the 
mode should be the same. Moreover, larger deviations from the mean are increasingly 
rare, though possible. Also, the distribution should be symmetric. 


However, the heights of children of both genders in grade five classes in Calgary 
are not normally distributed; we have a bimodal distribution as heights of boys tend to 
be higher than that of girls. Another example would be the household incomes in 
Calgary. Incomes are not normally distributed; we have a sizeable affluent population 
in our city. Thus it is not true that larger deviations from the mean are increasingly rare; 
this example shows that large deviations from the mean can be quite common. 


Once we understand this normal distribution, we can then describe how data are 
distributed around this peak. Everyone is familiar with the idea of a mean or average 
but sometimes it might be more useful to describe data in terms of median or mode 
(See Textbox: Calculating Descriptive Statistics). 
For example, if there are extreme values in a 
dataset, the median might better reflect a measure 
of centrality than the mean. Comparing the mean, 
median and mode, can give us information about 
symmetry. A variable that is symmetrical has data 
that are mirror images on both halves of the 
distribution. Data that are more spread out on the 
left side are considered left or negatively 
skewed, and data spread to the right are right or 
positively skewed (Figure 8). In a situation where 
data are skewed, the median is usually a better 
representative of centrality than the mean. 
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Outliers Left-skewed 
at low 
values 


Right-skewed Outliers 
at high 
values 


Mean | | kai al a ee 


Median Median 


Figure 8. Graphs depicting data that are left-skewed (upper panel) and right-skewed. Note the 
locations of the mean, median and mode. 


Another important consideration is the variation in a variable. Variation 
describes how widely particular scores or values are distributed (Figure 9). A dataset 
with low variation has values clustered close to the mean while a dataset with high 
variation describes the situation where data are spread out over a large range of values. 
Two variables might have the exact same mean, but the distribution can look quite 
different if they have differing levels of variation. Instead of always having to graph the 
distribution of each variable, we can use a statistical test to compute a variation score: 
this is called standard deviation. The standard deviation (usually written as o or SD) 
takes into account how much each score deviates from the mean; thus, a larger standard 
deviation indicates more variation. 


Low variation Moderate variation High variation 


Figure 9. Graphs illustrating data with different levels of variation. 


In a deterministic world, we can predict results. We can determine whether it is 
going to rain tomorrow, what will happen to the price of gold in the future, or even the 
winning lottery numbers. However, we live in a non-deterministic world where we 
cannot be sure of outcomes. Having said that, all is not lost. Even though we do not 
know what the future holds, we are still able to claim a certain probability associated 
with a given event. For example, based on the current and predicted weather patterns, 
we can estimate the probability of rain tomorrow or next week. When discussing a 
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random variable, x, we talk about the probability of the values of x falling within some 
given interval. If data have a normal distribution, we can use the intervals of standard 
deviations away from the mean to form our probabilities. 


The probability of the x value falling within one standard deviation away from 
the mean is approximately 68%. Remember that we looking at standard deviation in 
each direction (positive and negative) away from the mean so one standard deviation 
includes 34.1% on either side of the mean (Figure 10). If we consider values within two 
standard deviations away from the mean, we can talk about events with a 95% 
probability. We can even have the probability of 99.7% with variables plus or minus 
three standard deviations away from the mean. The spread away from the mean can be 
calculated mathematically; these are sometimes referred to as z-scores. 


In addition to expressing the variability of a population, standard deviation is 
commonly used to measure confidence in statistical conclusions. In science, researchers 
commonly report the standard deviation of experimental data. We expect most data to 
fall within the normal distribution, thus when performing an experimental test, it is 
unlikely that particular results that fall outside of this range of standard deviation will 
occur due to chance alone. Only effects that fall far outside the range of standard 
deviation are considered statistically significant. 


0.4 


0.2 0.3 


13.6% 


0.0 0.1 


-30 -20 -10 u lo 20 30 


Figure 10. Graphical representation indicating the distributions for one, two and 
three standard deviations (o) away from the mean (1). 


It might be easier to think out this idea in terms of probabilities. There are lots of 
different statistical tests and many of these tests calculate a test score and a probability 
value, or p value. What these tests are doing is calculating the probability that an event 
occurs, given that the null hypothesis is true. Then to make a decision, we compare this 
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probability to a prescribed level of significance (referred to as a). Typically, we use the 
significance level of 0.05 (or 5% probability). Let’s see this process in action. 


Storm Probability and Significance: Puget Sound in Washington is battered by a severe 
storm every three years on average. It widely believed the occurrences of these storms 


are independent of each other and random. However, over the span of last three years 
this region was hit with the storm each year. Is this enough evidence to reject the null 
hypothesis of independence and randomness of these storms? 


Solution: We can never be 100% sure but we can set a significance level and test the 
hypothesis. Let’s assume that storms are random and independent, this will be our null 
hypothesis. Then, how likely is the run of three storms every year? What is the 
probability, p, of this occurrence? Every year the chance of the storm hitting in the first 
year is 1/3 and the probability of the storm hitting the second year is still 1/3, and the 
same for the third year. When combining probabilities such that the probability of event 
A and event B have an equal chance of occurring, we use p(A and B) = p(A) x p(B). Thus, 
the probability of three storms in a row is given by 


p(A and B and C) = p(A)x p(B) x p(C) 


This probability (called the p value) can be thought of as a percentage; we have p 
= 3.7%. It is a low percentage but it still can happen. Using the significance level of 0.05 
(or a 5% chance), we can compare our probability calculation. We find that 3.7% < 5%. 
What this means is that the probability of three storms in a row, given the null 
hypothesis being true, is less than 5%; thus, we say that are results are statistically 
significant (unlikely to have happened by chance) at a = 0.05. As a result, we reject the 
null hypothesis and we no longer believe the storms are independent and random. 
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If we are selecting a particular level of significance, how do we know what is 
appropriate? Is a 99% level of significance better than a 95% level of significance? To 
answer these questions, we must understand the concept of Type I and II errors. 


5.6.a. Type I and Type II errors 


Suppose we have a situation where we are asked to believe a certain proposition. 
Such a proposition might be a certain belief of the scientific community, a known 
reliability of a specific car, or the innocence of a defendant. We collect data and ask the 
following question: based on the data collected are we going to reject the null 
hypothesis? We are not estimating anything; all we are doing is determine if there 
enough evidence in the data to reject the null hypothesis. 


Take the assumption of innocence for a defendant (null hypothesis) as an 
example. There is a trial and evidence is presented. The evidence represents data we 
collected and we are asking the same question: is there enough evidence to reject the 
null hypothesis of innocence? In other words, is there enough evidence to declare the 
defendant guilty? We can never be 100% sure of a persons guilt, just as we are not 100% 
sure of anything in science, but we rely on the weight of the evidence to be able to 
reject the null hypothesis. For example, there might be overwhelming evidence against 
the defendant and yet the defendant is innocent. He/she can be at the wrong place at 
the wrong time, there might be some weird coincidence and so on. If we reject the null 
hypothesis (declaring the defendant guilty) while defendant is innocent we are 
committing a Type I error. To guard against such an error we can lower the significance 
level. Lowering the significance level amounts to requesting more evidence against the 
defendant. 


As expected there is a major drawback to all of this. As we lower the significance 
level (as we lower the chance of committing the Type I error) we are running a risk of 
letting a guilty person go free. This type of mistake is called the Type II error. The Type 
II error occurs when we erroneously fail to reject the null hypothesis; when we fail to 
declare the defendant guilty when in fact he/she is guilty. No matter what we do, we 
can never be 100% sure and moreover, the more we try to avoid a Type I error, the more 
likely we are to commit a Type II error, and vice versa. 


In our example of storms in Puget Sound (See: Storm Probability and 
Significance), could we have committed a Type I error? Yes, of course, we could have. 
But once we set the significance level, we also set the probability of committing Type I 
error and we know it is less than 5%. (The probability of committing Type I error is 
equal to the significance level.) Suppose we really fear committing a Type I error and 
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we lower the significance level even more. Now we are more likely to cling to a belief 
that the storms are random and independent. However, we are also now running a 
higher risk of committing a Type II error — failing to reject the independence and 
randomness of the storms when in fact the storms are not independent and random. 


If we really want solid evidence to reject the null hypothesis, we might set the a 
value (the significance level) to be really small, say a = 0.003, which corresponds to 
0.3%, or better yet three times out of a thousand. The choice of 0.3% is typically selected 
because it relates to properties of the normal distribution; it includes three standard 
deviations on either side of the mean (Figure 10). 


In our Puget Sound example, we no longer reject the null hypothesis at this tiny 
level of significance. The occurrence of three storms in a row is no longer statistically 
significant at this level. We hang on to the belief that the three storms in a row 
happened by chance and they are still random and independent. The Type I error is 
definitely avoided. However, now we fear committing the Type II error much more than 
before! 


Saskatchewan Tornadoes: Saskatchewan is battered by about 15 tornadoes each year. 
But intense tornadoes (F3 or higher), which are capable of ripping the roof off a house, 
occur only every 11 years. It widely believed the occurrence of these tornadoes is 
independent of each other and random. However, Saskatchewan was hit two years in a 
row. Is this enough evidence to reject the null hypothesis of independence and 
randomness of these tornadoes? 


5.6.b. Statistical Tests 


In performing analysis of data, we do more than just calculate probabilities. 
These types of analyses require a different type of analysis but they follow some of the 
same principles. A common test to determine differences between two groups / 
treatments is called a t-test. When comparing more than two groups (three or more), 
we use a test called an ANOVA. An ANOVA is actually an acronym for Analysis of 
Variance — exactly what we are looking at! When these comparative tests are performed, 
we are examining the distributions of the two (or more) groups. For a test to be 
statistically significant, we are looking for a 5% or less overlap (assuming a = 0.05) 
between the distributions. In other words, p < 0.05. Since both of these tests rely on 
examining a variables distribution, one rule for these tests is that the variables of 
interest must have normal distributions. In statistics, we call these tests parametric. 
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There are many other types of statistical tests that can be performed when data 
are not normally distributed (such as the Wilcoxon Rank-Sum test, Mann-Whitney U 
test, Smirnov test, among others); these analyses are called nonparametric statistics. As 
you take other courses, you'll learn more detail about these tests included the 
mechanism to calculating these statistics. At this point, however, it is important that you 
understand how to interpret the results (p values) from these tests. 
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Chapter 6. Manipulating Scientific Information 


Our earlier discussion of how a theory can change over time (Section 4.3.a) 
highlights how science is a work in progress; it is constantly adapting and changing in 
light of new evidence, ways of thinking and technological advances. Because of its 
changing nature, scientific conclusions, although reliable, are tentative. They could be 


subject to change at any time. Don’t worry - the world is not going to change overnight. 


Many ideas have survived repeated challenges and yet still remains as the prevailing 
scientific belief. This is not to say that we shouldn't have confidence in our scientific 
knowledge but we do need to recognize that skepticism and challenges to ideas is the 
norm. 


6.1. Correlation and Causation 


Can this tentative nature of science be misconstrued and even manipulated? Yes, 
and in fact, it has in many ways. For example, why did it take so long to regulate 
cigarette smoking? The scientific community compiled numerous studies showing a 
relationship between cigarette smoking and cancer. Why did the public not accept this 
evidence? One of the main reasons was the great marketing campaign waged by the 
tobacco industry. They were able to exploit the tentative nature of science and persuade 
the public that there were no concrete conclusions. Much of their rhetoric dealt 
specifically with the language of scientists. Scientists were showing relationships; they 
couldn’t show cause and effect. However, unless a scientist is able to control all 
confounding variables (see Section 5.3.), they are not going to claim that variable x 
causes y, and thus, they temper their language to reflect these findings. Scientists talk 
about correlations, relationships, and indicators; this doesn’t mean that they are 
waffling on their ideas — they are respecting the scientific process and their numerical 
analysis. 


Think about it. Can you come up with an experiment on humans that would indicate a 
causal (cause and effect) relationship? 


So how do scientists show relationships between variables? Scientists use the 
principles of correlation. Using a statistical test, scientists can calculate a correlation 
coefficient that indicates the strength and directionality of the relationship. A positive 
relationship has a correlation coefficient (r) that approaches 1.0 and a negative 
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relationship is indicated by a coefficient that 
approaches -1.0. A perfect correlation would be if 
all data points lie on a straight line — this would 
be a perfect positive (1.0) or negative (-1.0) 
relationship — although this is rare in science. If no 
relationship exists between variables, the 
coefficient is 0. 


There is temptation to infer cause and 
effect, especially when looking at a graphical 
representation of a correlation, but keep in mind 
that there may be other explanations for the 
relationship. For example, a relationship might 
not truly exist; both variables might be influenced 
by acommon underlying cause. Alternatively, 
there might be multiple causes for the relationship 
and we are just seeing one small portion. Finally, 
these data might be purely coincidental. All of 
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Correlation: The above fictional figure 
has a correlation coefficient of r = 0.80. 
What type of relationship is depicted? 
Write a sentence to explain this figure 
if the X variable is the number of days 
of drinking chamomile tea and the Y 
variable is the mean number of hours of 
sleep in a night. 


these reasons highlight the importance of repeatability in science. 


6.2. Using Percents 


Probably the most common method of distributing information is through the 
use of percents. News reports, government data and even our everyday speech 
commonly reference percents as a way to discuss information. But look at any of these 
sources, and you'll likely find inaccuracies. For example, how many of you have said 
that you’ve “given 110% effort into something” — studying for a test, perhaps. Think 
about it - what is 110%? The maximum effort you can put into anything is 100%. Thus, 
it is impossible to give more than the maximum amount. 


We use percents to discuss a change in an object, or to compare among objects. 
We are all familiar with how to calculate percents but in many cases (sometimes by 
mistake, other times on purpose), these calculations can be performed incorrectly. To 
calculate an absolute change in percent, we are calculating an increase or decrease from 
some reference value to a new value. A relative change is reflecting the size of an 


absolute change in comparison to the reference value. 
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absolute change = new value - reference value 


: absolute change 
relative change = ——————_—- 
reference value 


Let’s look at an example to illustrate the difference between absolute and relative 
change. 


Calgary Population Growth: In 2006, the population census found that the population 
of Calgary was 991,759 people. By the recent 2011 census, the population of Calgary had 
reached 1,090,936. Describe the absolute and relative change from 2006 to 2011. 


Solution: We have a reference value of 991,759 people and a new value of 1,090,936. To 
calculate the absolute change, we use 


absolute change = new value - reference value 
= 1,090,936 — 991,759 
= 99,177. 

To calculate the relative change, we use 


. absolute change 
relative change = ————_—- 
reference value 


_( 99,177 
991,759 


= 0.1000 


Thus over a 5-year period, the Calgary population increased by 10%. 


As you can see, the reference value is important in both these calculations and an 
incorrect reference value will provide incorrect results. Let’s again take our example of 
Calgary population growth. If we used the WRONG reference value, let’s say we used 
the new population size, we would have 
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. absolute change 
relative change = —————__— 
reference value 


_( 99,177 
~ \ 1,090,936 
= 0.0909 


Although this is only a 1% difference from our correct calculation (9% versus 
10%), you can see how this difference can be amplified depending on the situation. The 
population census is used for many purposes, one of which is getting resources. Our 
population growth defines how much federal funding is allocated for healthcare, road 
construction, transportation, etc. Thus, a small miscalculation can have major societal 
implications. 


6.3. Visual Presentations 


Another way that data can be misinterpreted is through the visual display of 
information. In your own lifetime, you have likely seen countless graphs or figures that 
have skewed information in such a way that makes you misinterpret results. Because of 
the wide variation in how data can be misrepresented, it is easier to learn the basic rules 
of the correct ways to display numerical information. 


The caption of your figure or table should have all the necessary information that 
is needed to be able to interpret your findings. Think about it in this way, if you only 
read the caption, could you understand the figure? Another rule of thumb is that 
captions should be above tables but below figures. 


The axes should be labeled with the name of the variable along with the units of 
measurement. Axes should be consistent in terms of scale; you never vary the distance 
or change your scale part way through a representation. A common misconception is 
that your vertical axis (Y axis) always starts at zero — this is incorrect. It might be easier 
to show small trends by adjusting the scale accordingly. This is perfectly appropriate 
but you need to be cautious that you’re not being deceptive in your presentation. 
Another thing to watch out for is that the scale is numerical and not logarithmic (or 
exponential). Logarithm scales correspond to a power of ten; in other words, a 10-fold 
increase with each increment. If you don’t recognize these differences, you might have a 
very different interpretation of the findings. 
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Any data points should be clear and concise. Sometimes authors will present 
data in the form of boxplots, histograms, or datapoints, but data should never be 
presented as images, pictures or other artwork. Graphs that are adorned with images 
are referred to as pictographs and might be cute, but they are an inappropriate way to 
present information. Visual representations of information are supposed to provide an 
easier way for the reader to interpret the findings. Cluttering a figure with artwork 
might distort the true findings by altering perception, distracting the reader, or even 
masking the true findings. In addition to pictographs, all visual representations should 
be consistent. For example, you shouldn’t have bars of a histogram being depicted in 
different widths if all they are indicating is a frequency. Different widths can trick the 
reader into misinterpreting findings. 
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Examining Natural Disasters 


Now that we have learned some fundamental principles about numeracy and 
science, let’s examine some examples of natural disasters that we commonly hear about 
in our everyday lives. The Earth influences every aspect of our lives so it is important 
to be able to understand what's going on around us, the consequences of these actions, 
and how it natural disasters have shaped human history. 


One of the most dangerous threats to a society is a natural disaster. Floods, 
earthquakes and volcanoes have destroyed entire societies, within minutes, and many 
are difficult, if not impossible to predict. In this chapter, we will consider natural 
disasters — what they are, how we learn from them, and how we can take that learning 
to mitigate risk to human populations. We will also look at how they change our 
perspective of our world — something that we often see as static and non-changing, until 
disaster strikes. 


What does it mean to be literate in the Earth Sciences, especially in the context of 
natural disasters? It means understanding enough about the underlying causes of 
natural phenomena to be able to assess the type of risk that may be associated with a 


place. It is about understanding what can be measured, and what different 
measurements mean (for example, various scales of measuring earthquake strength). A 
literate person knows what can be predicted with scientific methods, and what cannot, 
and how to interpret probabilities. It means having a critical sense of what is factual 
information and what is conjecture. 


One foundation of this literacy is the understanding of how the earth and natural 
disasters are studied. There are direct methods, in which physical contact is made — for 
example, sampling rocks, and sediments, taking atmosphere and water samples. 
Indirect methods also provide information, but are based on interpretation, 
interpolation, and inference. Examples of indirect methods include mapping with 
satellite images or reading seismic profiles. With natural disasters, timing is also an 
important consideration in data collection. We often are left collecting data after the 
event, and trying to piece together bits of information we think are important. From the 
data gathered either during an event or afterwards, models are made to help predict 
how Earth’s systems change in response to human or natural pressures. Results of those 
models help to assess risk, inform government and environmental policy, and 
investigate ways of mitigating or forecasting the hazard to protect human lives and 


property. 
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Chapter 7. Earthquakes 


In foresight, not hindsight, the course of history often looks predetermined. But a 
natural disaster can alter that course without warning. At the beginning of the 15th 
Century for example, the Portuguese had leaped ahead of their European neighbours 
technologically. With intense study and experimentation in the fields of Astronomy and 
Geography, they managed to develop technological innovations that allowed them to 
literally set out to sea. Determining a ship’s location out in the open ocean beyond the 
sight of land to this point had been an exercise in rough guesses and speculation. With 
the introduction of improved navigational devices like Quadrants along with vast 
improvements in ship and sail design, the Portuguese seafarers set off to explore the 
uncharted oceans, and to find riches. Their Spanish neighbours quickly adapted the 
new maritime technologies and launched into an open and often fierce competition 
with the Portuguese. While the Chinese in the East had likely been ahead by fifty years 
in global ocean voyages of exploration, the European nations sailed off on voyages not 
of exploration, but of discovery and acquisition. Within 300 years the Portuguese and 
Spanish had claimed large areas of South America and Central America. The British, 
French, Dutch, and Germans were well behind. It seemed as though Spain, and 
certainly Portugal, would become the dominant ruling powers for at least the next 300 
years as well. 


On Saturday, November 1, 1775, an earthquake struck 200 km off the southwest 
coast of Portugal. Estimated at 8.5-9.0 on the Moment Magnitude Scale, the force easily 
toppled most of the stone and masonry buildings in the coastal, and capital city of 
Lisbon. Fearing the crumbling debris among narrow urban passageways, many people 
made their way to the open areas of the shoreline and wharfs. Their relief lasted only 40 
minutes before they were confronted with earthquake induced tsunami waves 12 m 
high. Fire also broke out in the mayhem that engulfed what remained of the city. The 
final death toll was approximately 50,000 by some accounts in a city that had 200,000 
people that morning. 


As an administrative centre, with seventy to eighty percent of Lisbon’s buildings 
destroyed, the necessary record keeping and clerical expertise involved in maintaining 
and expanding an empire had suddenly vanished. They became the only European 
country to temporarily move its capital to one of its colonies. That colony, Brazil, 
remains the only country today in the Western Hemisphere to have its official national 
language as Portuguese. 


If it weren’t for the ‘Great Lisbon Earthquake’ of 1775, it leaves one wondering 
what language we might be speaking today in places like Canada and the United 
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States? It may also open our eyes to speculation about a future that may seem inevitable 
today, but in reality, the future is more often than not, a strange place. 


7.1. Definition 


In a previous section, you learned that the Theory of Plate Tectonics describes 
how the Earth’s crust is composed of massive plates or slabs that move in response to 
the warmer mantle beneath (Section 4.3.a). Where these plates meet are areas of intense 
tectonic activity. There is a lot of friction between the plate edges, and when they do 
suddenly move past each other, an immense amount of energy is released. This energy 
is felt as vibrations in the form of an earthquake. The release of tectonic energy is much 
like when you snap your fingers together. The release from the friction of your skin 
causes your finger to slip, striking the skin on your hand. This action generates sound 
waves, which then travel through the air, much like how seismic waves travel through 
the ground. 


7.2. Earthquake Energy 


Earthquake energy propagates or moves in waves, both through and on the 
surface of the Earth. Body waves travel through the Earth in two forms (Figure 11). The 
first is the primary, or P wave. These waves are the fastest and are the first wave of an 
earthquake to be felt and recorded. They move much like sound waves, as 
compressions, in a push-pull action. P waves can move through any substance, since 
solids, liquids and gases can all experience compression and expansion, and can have 
particles all moving in the same direction as the wave propagation. The secondary, or S 
waves follow. In these waves, the particles move side-to-side, or up and down, 
perpendicular to the wave front. Because there is movement in two directions at once 
(forward and sideways), S waves cannot move through liquid or gaseous materials. 
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Figure 11. Surface waves travel along the surface of the earth and can have two different forms. A. 
Rayleigh waves move in a rolling manner. B. Love waves shift the ground in side to side motion; 
an effect that can be devastating to buildings and other non-flexible structures. 


There are also surface waves that move along the Earth’s surface (Figure 12). 
Rayleigh waves move in a rolling fashion, like ocean waves curling backwards, and 
Love waves move in a perpendicular, side to side motion. Building stability is more 
threatened by sideways ground movement and shaking compared to up and down 
movement, so the Love waves are the most potentially destructive. 
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Figure 12. Types of seismic waves and the motion they generate. A. P-waves are compression 
waves and can compress and expand materials. B. S-waves shake the ground perpendicular to 
their direction of travel and produce up and down movement of the ground. 
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7.3. Earthquake Source 


Earthquake energy radiates outwards from the earthquake origin, called the 
focus, or hypocenter. We often refer to the epicenter as being the source of the 
earthquake, which is the point on the Earth’s surface immediately above the actual 
focus (Figure 13). 


There are two key relationships that aid in our understanding of an earthquake. 
The first is that as energy moves away from the epicenter, it dissipates, or lessens. The 
second is that the farther away from the epicenter, the greater the time delay between 
the fast moving P waves and the slower S waves. Knowing these relationships, and 
with the ability to detect and measure earthquake energy around the world, it is 
possible to determine: the location of an earthquake epicenter, based on measuring the 
timing and strength of the P and S waves at three measurement locations, and the 
strength of the earthquake, based on the magnitude of the vibrations felt in relation to 
distance. 


7.4. Measuring Earthquakes 


Since earthquake energy 
manifests in the form of vibrations, an 
instrument that is sensitive enough to 
can record minute movements of the 
ground is used to detect and measure 
earthquakes. A seismometer (seíð, to 
shake; métron, measure), also referred to 
as a seismograph (seismo, to shake, and 
graph, to write), is based on the principle 
that a suspended weight will remain 


inert while the ground moves, and the Figure 13. The focus is the location where the initial 
differential movement between the displacement occurs and the epicenter is the surface 


shaking ground and thetnert weight location directly above the focus. 


will be recorded. There are over 125 

seismograph stations run by the Canadian Government across the country, constantly 
monitoring ground movements. Records of seismic activity are called seismograms, 
and they indicate the time between movements, as well as the strength of the 
movements, reflected in the height or amplitude of the markings. 
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Seismographs can also record other vibration-causing activities, such as mining 
explosions and nuclear events, but the signals produced by these events have a different 
pattern than earthquakes. In the seismogram below (Figure 14), three characteristics of 
an earthquake are apparent. The first is the P wave, arriving long before the other 
waves. The S wave is next, with the longer marks away from centre, indicating that it is 
stronger, followed immediately by the surface waves, which are the strongest. 
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Figure 14. A seismogram records the early arriving P waves, followed by S waves, and then surface 
waves (both Love and Rayleigh waves). The time interval between these waves are measured so that 
we can determine the location of the epicenter. 


7.5. Locating an Earthquake 


P waves travel at higher speeds than S waves. Typically the velocity of an S wave 
is about 60% of the P wave velocity. Therefore P waves arrive at the measuring station 
earlier than S waves. The time delay between the beginnings of the P and S waves gives 
a measure of how far away the earthquake was located. We don’t know in which 
direction but we can calculate the distance. 


Earthquake Distance: Based on this time delay, we can estimate the distance of the 
epicentre of the earthquake from the measuring station. Consider the following 
simplified model. Suppose there was an earthquake with an epicentre d km away from 
the measuring station. The resulting P wave arrives at a speed of 5 km/s at the station 
and the resulting S wave arrives at a speed of 3 km/s. The time delay between these 
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wave arrivals was At = 10 seconds. (The delta symbol is used to represent a change in 
something; in this case, the change in time.) Calculate the distance d of the epicentre 
from the measuring station. 


Solution: Let d denote the distance of the epicentre of the earthquake from the 
measuring station. Let vp= 5 km/s be the speed of the P wave and let tp denote the time 
in seconds for the P wave to arrive at the station. We have 


Similarly let vs= 3 km/s be the speed of the S wave and let ts denote the time in seconds 
for the S wave to arrive at the station. We have 


Usts = d 
Note t:> tp. Now the time delay between the arrival times is given by: 


At=t,-t 


S P 


=75 


Therefore, the epicentre of the earthquake is estimated to be about 75 km from the 
measuring station. 
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From the above example, the earthquake occurred 75 km from the measuring 
station but we still don’t know in which direction. So how does this distance 
measurement help us determine the location of an earthquake? 


When we have distance measurements from three different locations, we can use 
the process of triangulation to pinpoint the epicenter. Triangulation works by drawing 
circles at the appropriate distance from the monitoring station. The location of the 
epicenter is where the three circles overlap (Figure 15). With only two circles (i.e., two 
distances), there would be two overlap points, and more investigative work would be 
required to determine the location of the epicenter. 
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Figure 15. Using distance measurements from three different measuring 
stations (Paris, Nagpur and Darwin), we can use triangulation to determine the 
epicentre of the earthquake. This is the point where all three radii overlap. 


This is the same principle used with GPS receivers to locate our exact position on 
the Earth’s surface. The receiver first determines how long it takes for a signal from 
different satellites to reach it. Secondly, it calculates the distances between it and each of 
the satellites, and finally, knowing their positions in orbit, calculates the spot where all 
the distance measurements overlap. 


81 


7.5.a. Significance of Location 


By plotting out the location of earthquake locations around the world, a direct 
relationship to the major tectonic plates becomes apparent (Figure 16). The majority of 
the world’s earthquakes are situated along the plate boundaries. When earth scientists 
began to realize that the depth of earthquakes also had a pattern (they are clustered 
along converging plate boundaries), they realized that something more complex was 
occurring along those boundaries. Earthquakes were another line of evidence that 
supported the Theory of Plate Tectonics. 


Figure 16. Locations of earthquakes having magnitudes ranging from 5.5 to 9.5 for the time period of 
1900-2010. Dots and triangles indicate earthquake locations and yellow lines depict plate boundaries. 
Credit: Earthquake Hazards Program, U.S. Geological Survey. 


Earthquakes occur where plates intersect and move against each other, or split 
apart (Figure 17). Three different scenarios are: 
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Divergent: the two plates move away from 
each other. Examples include Iceland (Mid- 
Atlantic Ridge), and the Great Rift Valley in 
Africa. 


A. Divergent boundary PS 


Convergent: two plates move towards each 
other. If one plate is more dense than the other 
(i.e., oceanic crust vs. continental crust), it will 
descend below the other, eventually reaching 
depths of around 100 km before melting and 
becoming part of the mantle again. It is in this 
“subduction” zone where deep earthquakes 
occur. 


Transform: the plates slip sideways past each 
other. An example is the San Andreas Fault in 
California, where the Pacific Plate is moving 
north relative to the North American plate. 


C. Transform fault boundary Z 


Figure 17. The three types of plate boundaries. A. Divergent boundary. B. Convergent boundary. C. 
Transform boundary. 


Sudden movements of the plates in any of these configurations will cause 
earthquakes. Though the plates could be in constant motion, friction causes the 
plates to “stick”, building up elastic energy. The plates eventually snap to their 
desired location, and this causes the quake. In Canada, the areas of greatest 
tectonic activity are the west coast near Haida Gwaii and Vancouver Island, where 
the Juan de Fuca plate is descending below the North American plate, and the St. 


Lawrence Valley, which is one of the few 
areas on Earth where earthquake activity is 
not associated with a plate boundary (Figure 
18). Fault lines are linear features where the 
movement manifests and appears as a scar 
on the surface. 


7.5.b. Intensity and Magnitude 


We often refer to earthquakes in terms of 
size — a “big” one, or a “small” one. But what 
does that mean? Does size refer to the strength, 
the amount of area affected, or the damage 
done? There are different scales that measure 
earthquake “size” — either intensity or 
magnitude. 


Intensity measurements represent the 
amount of shaking, based on damage that 
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Figure 18. Interactions among different 
plates along the Pacific coast of North 
America. Credit: U.S. Geological Survey. 


occurs. In the Modified Mercalli Intensity Scale, the scale is based on qualitative 
observations of damage to buildings built to standard codes and practices of North 
America, specifically California (Table 4). The drawback of course, is trying to use this 
scale away from that baseline. For instance, in an area where buildings may not meet 


California standards, more damage 
would occur, and appear at a higher 
intensity value, making comparisons 
between earthquakes in different locales 
difficult in inappropriate. This scale is 
useful though, as a means of classifying 
on a map how various parts of a region 
were affected by a quake (Figure 19). 


Collecting data for the Modified 
Mercalli Intensities is a slow and 
cumbersome process. Citizens are asked 
to respond through mail about 
observations they had during and after 
the earthquake - the shaking that was 
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Figure 19. Levels of damage sustained after a 8.1 
magnitude earthquake struck Queen Charlotte 
Island, BC. Levels based on the Modified 
Mercalli Intensity Scale. 
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felt, and the damage noticed. Additional field observations are made by emergency 
response personnel, engineers and even media reports. Compiling, analyzing and 
interpreting the data can take months for large earthquakes, but it is challenging, if not 
impossible, in remote areas where infrastructure and residents are minimal or non- 
existent. 


Table 4: An abbreviated description of the 12 levels of Modified Mercalli Intensity Scale 
(USGS, 2009). 


Level Characteristics 


I I. Not felt except by a very few under especially favorable conditions. 
II II. Felt only by a few persons at rest, especially on upper floors of buildings. 


Felt quite noticeably by persons indoors, especially on upper floors of 
buildings. Many people do not recognize it as an earthquake. Standing 
motor cars may rock slightly. Vibrations similar to the passing of a truck. 
Duration estimated. 

Felt indoors by many, outdoors by few during the day. At night, some 
awakened. Dishes, windows, doors disturbed; walls make cracking sound. 
Sensation like heavy truck striking building. Standing cars rocked 
noticeably. 

Felt by nearly everyone; many awakened. Some dishes, windows broken. 
Unstable objects overturned. Pendulum clocks may stop. 

Felt by all, many frightened. Some heavy furniture moved; a few instances 
of fallen plaster. Damage slight. 


VII Damage negligible in buildings of good design and construction; slight to 
moderate in well-built ordinary structures; considerable damage in poorly 
built or badly designed structures; some chimneys broken. 

VI Damage slight in specially designed structures; considerable damage in 
ordinary substantial buildings with partial collapse. Damage great in poorly 
built structures. Fall of chimneys, factory stacks, columns, monuments, 
walls. Heavy furniture overturned. 


Damage considerable in specially designed structures; well-designed frame 
structures thrown out of plumb. Damage great in substantial buildings, with 
partial collapse. Buildings shifted off foundations. 
Some well-built wooden structures destroyed; most masonry and frame 
structures destroyed with foundations. Rails bent. 

XI Few, if any (masonry) structures remain standing. Bridges destroyed. Rails 
bent greatly. 


XII Damage total. Lines of sight and level are distorted. Objects thrown into the 
air. 


Canada has an initiative to collect intensity data include using the internet and 
social networking tools. Following an earthquake, the Community Internet Intensities 
Mapping (CIIM) program creates and posts a map of intensity, based on the damage 
information provided by local residents, through online questionnaires. The data is 
compiled and presented by postal codes, and each postal code area is colour-coded to 
represent the appropriate intensity value. Frequent updating and dissemination on a 
web platform ensures that the most up-to-date information is available to the general 
public, and is an accessible record of the impact of the earthquake. 


Magnitude describes the relative amounts of energy released in earthquakes. In a 
seismogram, magnitude is represented by the amplitude (height) of the waves. Once 
distance to the epicenter is taken into consideration, the amplitude measurement is 
“normalized” to estimate the energy released at the source of the quake. This is a 
quantifiable value that can be compared to other quakes, anywhere in the world. 


The Richter scale is probably the most well known of the magnitude scales. 
Charles Richter developed it in 1935, and intended for it to only be used to measure 
earthquakes in his local area (California), hence the symbol Mı (magnitude local). 
However, because it is useful in providing a single value to describe an earthquake, and 
the ease of comparing earthquakes from different geographic areas and times, it has 
been widely adopted around the globe. 


The Richter scale expresses two characteristics of an earthquake, the amount of 
ground shaking and the energy released. The amplitude of the largest wave on the 
seismogram is used as the primary measure. Using that value, and the S-P interval (time 
between the arrival of the P wave and the S wave, which provides a measure of 
distance), a mathematical formula is applied to derive the magnitude value. This can be 
done graphically as shown in Figure 20. 


Mz is determined by plotting the amplitude value on the right scale, and the time 
interval on the left scale, and drawing a line between. For earthquake A in Figure 20, it 
is apparent that an M, 3.5 earthquake could be designated for this seismic event. 
Earthquake B has the same time interval but its amplitude was much higher (stronger 
waves). Thus, this earthquake is designated as a Mı 4.5. The benefit of the Richter scale 
is that it provides an absolute value of magnitude for the energy at the source, not just 
at the measuring station. The Mı values start at zero, for no energy released, and while 
there is no upper end, there have been no recorded earthquakes in the order of Mı 10. 


The magnitude amounts represented by the Richter Scale have an enormous 
range, that is to say that the strength of an earthquake can vary between a tremor that is 
not even felt by humans (M; 0) to a cataclysmic event equivalent to 1 billion tones of 
TNT (Mz 9). In order to capture this range a logarithmic scale is used, instead of a 
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linear scale. Each value on the Richter scale represents a 10-fold (101) increase in ground 
movement or displacement. Using the logarithmic scale, an earthquake with a M, value 
of 8 is 10 (or 10!) times stronger than an earthquake with a value of 7. An earthquake of 
Mz 6 is 1000 (or 103) times more powerful than an earthquake of M13. If a linear scale 
were used, a value of 6 would indicate that it was twice as powerful an Mz3. 
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Figure 20. Determining an earthquake magnitude based on distance and amplitude of body waves (P 
and S waves) using the Richter magnitude scale. 


When we talk about actual energy released, the difference is more extreme. On 


the Richter scale, each value or step represents an approximate 32-fold increase of 
energy released (1015). An earthquake measuring M_6 is roughly 32,000 times more 
powerful than an earthquake of M:3, not double as a linear scale would suggest. 
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Therefore, an understanding of the scale used is critically important for anyone talking 
about, and especially comparing, earthquakes! 


Earthquake Magnitude: On March 11, 2011 an earthquake of magnitude 8.9 on the 
Richter scale has struck Japan. Later on the Richter scale reading of this earthquake was 
revised to be 9.0. When this revision was made what was the resulting percentage 
increase in the energy released? 


Solution: To solve this question we rely on the Richter Gutenberg Seismic Method that 
states: 


E =1018+15R 


where E is the energy (in some units) released by an earthquake and R is the Richter 
scale reading of the earthquake. The value 11.8 is a constant that is included in the 
equation. The energy released by an earthquake of magnitude 9.0 is given by: 


Eo.0= 1011.-8+(1.5)(9.0), 

and the energy released by an earthquake of magnitude 8.9 is given by 
Eso = 10)11-8+(1.5)(8.9), 

The energy ratio between these earthquakes is then given by 


E 10118+1590) 


9.0 


E 


= 11.8+(1.5)(8.9) 
8.9 10 


= 10152-89) 
= 10° 
= 1.41 


Thus, the 9.0 magnitude earthquake would release about 1.41 times more energy than 
the 8.9 magnitude earthquake. In percentage, we would say that the 9.0 earthquake 
releases about 41% more energy than the 8.9 earthquake. 


Earthquake Energy: Suppose there were two earthquakes in the area, the first one had 
the Richter scale reading one unit higher than the second one. How much more 
powerful was the first earthquake than the second, in terms of energy released? 
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The Richter scale has a few disadvantages. One is that it was intended to 
measure local earthquakes (range of 100 km), and measurements become somewhat 
unreliable at large distances away. The other limitation is that one of the key measures, 
the timing of the P and S waves, assumes that the velocity of the waves is uniformly 
constant around, and in, the world. This is not the case — it varies with the type of rock/ 
sediment that the wave is passing through. There are also concerns that the strongest 
earthquakes are not measured as accurately as smaller quakes. The “Moment 
Magnitude” (Mw) measurement was developed to address more accurate 
measurements of larger earthquakes, and is being employed more frequently 
throughout the earthquake research community for those larger events. 


This scale is based on actual displacement measurements along faults at the 
location of the earthquake, instead of remote measurements of movement, like the 
Richter scale (although it can also be calculated from long period wave activity captured 
on seismograms). The Moment Magnitude calculation takes into account average 
displacement or movement along a fault, how far the rupture occurred (to calculate area 
displaced), and the ability of the rock to store strain before releasing (rigidity). The 
Moment Magnitude values are calibrated to Richter scale values, so that medium sized 
earthquakes (Mı ~5) will have the same Mı and Mw value. Larger and smaller 
earthquakes however, will often show more of a discrepancy between the two 
measurements, with the Richter scale being more appropriate for smaller quakes and 
the Moment Magnitude more appropriate for larger quakes. 


The important thing to remember about measuring an earthquake is that various 
scales that can be used. With both qualitative (intensity) and quantitative (magnitude) 
data, either measurement can take days and numerous observations until scientists can 
come to a consensus on what the values should be. In the quantitative data, 
uncertainties are generally in the order of +0.3 units. 


7.6. Hazards 


In Canada, about 3,500 quakes are detected every year, but of those, only about 
50 are noticed by people. The most obvious direct effect noticed by people is ground 
movement: at a minimum, it causes people to lose their balance, buildings to sway or 
shake, and items to rattle or fall off of shelves. This can be exaggerated in stronger 
earthquakes where walking is impossible and buildings collapse. Another direct effect 
is fissures or obvious ground displacement (road and fence offsets as seen in Figure 21). 
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The amount of direct destruction an area experiences from an earthquake 


depends on the: 


magnitude of the earthquake, 
distance to the epicenter, 


population density — the higher density areas 
will suffer greater damage, 


amount and strength of infrastructure - larger 
amounts of infrastructure, and that which has 
not been built specifically to withstand 
earthquakes, will increase the damage that 
occurs, and 


ground conditions — the sediment upon which a 
settlement is built plays a large role in how 
damaging an earthquake is. 


In areas built upon unconsolidated materials 
(glacial, river, lake deposits — i.e., not solid rock), and 
particularly where those sediments can become 
saturated with water, there is a high risk of 
liquefaction. Liquefaction describes the effect of the 
solid substrate (soil, sediments) turning into a liquid 


Figure 21. An example of 
horizontal and vertical 
displacement of a road after a 
magnitude 7.2 earthquake in Idaho 
in October 1983. Image Credit: G. 
Reagor, U.S. Geological Survey. 


when seismic vibrations shake the sediments and water together. The ground becomes 
incapable of supporting buildings; it can 


subside over large areas, creating sinkholes, 
and geysers of spewing mud, sand, and/or 
water. This was seen in the Christchurch, 
New Zealand earthquake from the initial 
quake in September 2010 to the strong 
aftershocks occurring in June, 2011, where 
bubbles of sand, silt and mud emerged from 
the ground, leaving tonnes of sludge for 
home and land owners to clean up (Figure 
22). Living on saturated deltaic sediments, 


and being situated next to an active 
Figure 22. One example of the liquefaction that subduction zone, liquefaction is the greatest 
occurred after the February 2011 New Zealand risk factor for residents of Vancouver. 


earthquake. (Photo credit: M. Luff) 
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Indirect effects of earthquakes can be more deadly than the actual earthquake 
itself. Most notably, landslides and tsunamis have caused hundreds of thousands of 
deaths. Landslides occur when sloped land loses stability, from liquefaction of a weak 
layer, from displacement of the supporting base, or a redistribution of mass from the 
shaking. As the land gives way and moves downslope, it can take away everything in 
its path, causing considerable damage. 


Tsunamis are large influxes of ocean water inland that occur when there has 
been a displacement of the ocean floor, either through underwater landslides or vertical 
movement along a fault. From a central point of origin, tsunamis radiate outward as a 
bulge of water — barely noticeable on the open ocean, and moving at tremendous speeds 
— up to 950 km/h. The wave speed is correlated to the water depth — the wave moves 
fastest in greater depths of water. When the wave approaches the shore where the ocean 
floor rises, the water slows down, but builds in height, finally reaching shore as a mass 
of water up to 30 m in height (Figure 23). 
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Figure 23. An illustration of how decreasing depth influences wave velocity, length and height. 


Tsunamis have frequently been referred to as tidal waves, but they have no association 
with tides. However, their approach is much like a low tide /high tide cycle to the 
extreme. The wave itself is generally not a giant curling wave, but a wall or mass of 
water that relentlessly moves inland, much like water at high tide, but with much more 
water, moving a greater distance inland, and much faster. A precursor to a tsunami is 
the recession of water from the shore, like a quick low tide. This phenomena is 
extremely dangerous to people who do not recognize it as an indicator of a tsunami and 
are drawn by curiosity out onto the newly exposed beach. Inadvertently, they put 
themselves in the worst position possible when the tsunami wave arrives. This was 
common during the December 26, 2004 tsunami that was triggered by an earthquake off 
the coast of Sumatra, Indonesia. 
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Knowing about precursors and warning signs can save lives, being unaware can 
put lives at risk. We now have a much better capability for forecasting tsunamis because 
of technology - a large number of seismic stations that detect earthquakes, buoys 
measuring sea heights, and communication networks that allow all of this information 
to be compiled and distributed appropriately. Since the devastation of the 2004 tsunami, 
the Indian Ocean now has a tsunami detection and warning system in place, similar to 
that which has existed in the Pacific Ocean, since an unexpected and deadly tsunami 
destroyed many villages in Hawaii in 1946. 


7.7. Predicting Earthquakes 


We can provide some warning for tsunamis, but what about what causes them? 
Can we predict earthquakes themselves? 


Just as in forecasting tsunamis, recognizing precursors is the key to providing 
warning of imminent earthquake activity. This is a short-term prediction — predicting 
when and where the risk is highest in a narrow window of time. Earthquake “warning 
signs” are diverse, and not consistent — hence the difficulty in prediction. These signs 
include increased strain in the rocks, changes in ground water levels, and movement 
along and adjacent to fault lines. Scientists measure these to look for patterns that may 
help predict future seismic events. There are also pseudoscience approaches to 
predicting earthquakes — looking to animal behaviour and cosmic planetary alignments 
for example. 


Long-term forecasts identify what areas are susceptible to earthquakes. The 
Theory of Plate Tectonics has helped enormously in our understanding of locations, 
timing, and magnitude of earthquakes. One key idea about plate tectonics is that the 
earth’s crust is in constant motion. Therefore, when one earthquake happens, it is 
simply relieving stresses that will begin to build again as the plates continue to try to 
move. This manifests in earthquakes happening periodically in the same area. Looking 
at historical records and geological evidence helps determine if a pattern of occurrence 
exists, which can lead to forecasting magnitude and recurrence intervals that can be 
expected for a certain area. Looking at the history and distribution of seismic activity 
can also help identify areas of seismic gaps — spatial breaks along plate boundaries that 
have not experienced an earthquake within a specific period of time. If adjacent areas 
have had seismic activity within the same time frame, these areas of inactivity may be 
considered “overdue” to move and adjust, and therefore most “at risk”. 
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Once it is determined what areas are most at risk for future earthquakes to occur, 
seismic hazard models are created to define risk areas. This information is then used in 
our day-to-day life to help with regional emergency planning and insurance estimates, 
and to define our local and national building codes, to provide us with the safest shelter 
possible. This allows residents to be aware and prepared if an earthquake does happen. 
The goal of the short-term prediction is to be able to warn people of an imminent quake 
to allow them to evacuate the area, while being consistently accurate and reliable in 
terms of location, magnitude, and timing. 


In California, earthquake probability maps are created for the whole state, to 
indicate the probability of a major (Mw 6.7) earthquake happening in the next 30 years 
in every area. There are also probabilities determined for individual faults. According to 
the Uniform California Earthquake Rupture Forecast (UCERF), the southern San 
Andreas Fault has a 59% probability of having one or more magnitude > 6.7 quakes in 
the next 30 years. Probabilities are generated from 4 sources of information, which are 
mathematically modeled in a geographic information system to generate probability 
values, and displayed in map form to be easily comprehended by residents. The four 
sources used to determine probability are as follows: 


geodesy: precise measurement of movements of the tectonic plates, 
structural geology: offsets and locations of faults, 


seismology: seismic characteristics of previous historical earthquakes, location of 
epicenters, strength, 


paleoseismology: information on past (unmeasured) seismic activity in the area 
— when earthquakes happened, how strong they were. 


One of the primary goals of earthquake research is to be able to reliably predict 
events. Although we are not at the point of being able to predict events, we can make 
estimates of the probability of earthquakes, given certain circumstances. 


According to the Southern California Earthquake Data Center, the probabilities 
of an aftershock occurring in the same area (which is one unit of magnitude smaller) 
following a major quake are as follows: 


within 1 day: 28% 
within 7 days: 40% 
after 7 days, but within 1 year: 26% 


Based on these data we can answer the following questions. 
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Aftershocks in the first week: Find the probability of an aftershock occurring within the 
first 7 days but not during the first day. 


Solution: Knowing that we have a 40% probability of an aftershock in the first seven 
days and a 28% probability within the first day, we can calculate: 


P = 0.40 — 0.28 
= 0.12. 
Therefore we have a 12% probability of an aftershock in the week following the original 
earthquake, but not including the first day. 


Aftershocks in the first year: Find the probability of an aftershock as above occurring 
within the first year. 


Solution: To calculate this probability we need to determine the probability of the 
aftershock occurring within the first week OR after seven days but within one year. Note 
we are not combining these probabilities; we are determining the probability that either 
of the two events occur. 


P(A or B) = P(A) + P(B) 
P = 0.40 + 0.26 
= 0.66 


Therefore we have a 66% probability of an aftershock in the next year. 


Aftershocks in the first week but not the first day: Given that no aftershock has 


occurred during the first day, find the probability that we will get an aftershock within 
the next 6 days. 


Solution: To solve this question, we need to find the conditional probability. Conditional 
probabilities are used when the outcome of one event (like an aftershock within the first 
day) affects the probability of the other event (such as the probability of an aftershock 
within the rest of the week). 


We know the probability of an aftershock during the first day (28%), so the probability 
of NO aftershock during the first day is given by: 


P = (1 -not P) 
=1 -0.28 
= 0. 72, or 72%. 
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Let P denote the conditional probability that we will get an aftershock in the next 6 days 
given that none has occurred during the first day. We must have: 


(1 — 0.28) P = 0.4 -0.28 
0.72 P = 0.12 
P = 0.17 or 17% 


since the probability of an aftershock occurring within the first 7 days and not during 
the first day equals the product of the probability of no aftershock during the first day 
times the conditional probability of an aftershock occurring within the next 6 days 
given none has occurred during the first day. Therefore we can say that the probability 
of an aftershock occurring with the next 6 days given that no quake has occurred during 
the first day is about 17%. 


Probability of Repeat Earthquakes: Once an earthquake has happened, the probabilities 


of having a larger earthquake occur in the same location within a given timeframe are: 


within one hour: 1.5% 
within three days: 6%. 


Find the following conditional probability: Given that no larger earthquake has 


occurred in the same area with the first hour, find the probability that we will get a 
larger earthquake in the same area within the next 2 days and 23 hours. 


7.8. Who’s at Fault? 


The ability to predict also comes with expectations. In April 2009, a large 
earthquake hit Aquila, Italy and 309 people were killed. Seven members of a committee, 
including six scientists and an official from the Civil Protection Agency were 
responsible for assessing the risk to the population of a recent increase in seismic 
activity. The scientists felt that the activity was actually releasing strain slowly and 
methodically, so that a large quake was not imminent, and citizens were not in danger. 
The government official’s message to the population one week prior to the quake was 
one of reassurance that they were indeed safe, based on the scientists’ information and 
predictions. Members of this committee have been indicted for manslaughter, with the 
rationale that citizens who intended to leave were encouraged to stay because of the 
message, and subsequently lost loved ones, or their own lives. Did the scientists truly 
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believe there was no risk, when there was substantive seismic activity ongoing? Or did 
they conclude that they simply couldn’t predict what else might happen, and therefore 
couldn’t assure that there would be a large quake? It brings to bear the question of 
responsibility and the tension between science and policy, and people’s expectations of 
science. 


"Nobody here wants to put science in the dock. We all know that the earthquake 
could not be predicted, and that evacuation was not an option. All we wanted was 
clearer information on risks in order to make our choices” said Vincenzo Vittorini, 
President of the local victims' association '309 Martiri' (Nosengo, 2011). 


In California, maps of the probabilities of state-wide earthquakes and individual 
faults are reviewed by individual scientific panels, as well as national and state 
Earthquake Prediction Evaluation Councils. They do claim there is some uncertainty, 
but perhaps the best message they voice is that all Californians should be prepared for a 
major earthquake. 


It might be easy to say that nothing good comes of earthquakes. But in fact, they 
have helped us understand the structure of the interior of the Earth, which has led to a 
greater understanding of our magnetic field, which shields us from deadly charged 
particles of the solar wind, enables our magnetic compasses that has enabled 
exploration and mapping, and is thought to be the mechanism by which birds and 
insects such as butterflies sense their place in the world when they migrate. Our 
observations of earthquake-generated waves have also fostered seismic studies and 
research of the crust, which supports oil and gas exploration. 


Remember that when an earthquake happens, different types of seismic waves 
are generated. The body waves, P and S, travel through the Earth’s interior, but the 
characteristics of their particular wave forms make them interact differently within 
different materials. If the Earth’s interior were completely homogeneous, the waves 
would move uniformly through the Earth, in a consistent direction and with a constant 
speed. Neither of these is true however. Most notably, S waves cannot move through 
liquid. Scientists observed that on the opposite side of the Earth from where an 
earthquake occurs, no S waves appear. This S wave “shadow-zone” was determined to 
be caused by a liquid-core — a medium that an S wave with its two directional 
movements (forward and perpendicular) cannot propagate through. The idea of the 
liquid core also explained why the P waves seemed to bend on their way through the 
Earth (refraction), and create a smaller shadow zone, and how a magnetic field could be 
generated (highly conductive molten iron flowing in the core). 
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By studying the propagation of seismic waves, and increasing our understanding 
of how various materials affect wave movement at different pressures and 
temperatures, scientists have been able to model the interior of the Earth (Figure 24). 

We have a sense of the “layers” of the Earth, the characteristics of each, and the 
boundaries between them. The Earth’s interior consists of: 


Crust 
(low density rock 


7-70 km thick) 


Figure 24. A cross-section of the interior of the Earth. The layers of the crust, mantle and core are based on the 
composition of the Earth while the lithosphere, asthenosphere, mesosphere and the outer and inner core are 
distinguished by physical properties. 


Crust: Mantle boundary (depth of 50 km below the surface): Called the Moho 
(after its discoverer, Andrija Mohorovičić) and recognized because the P-waves 
moved with different velocities above and below this boundary, due to the 
differing composition of crustal vs. mantle rock. 


Core-Mantle Boundary (depth of 2900 km): Studies of seismic waves revealed 
that at this depth, the P-waves changed direction slightly (due to refraction) and 
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slowed down, and the S-waves stopped moving entirely. The idea that the waves 
had encountered a liquid layer was the best explanation for these observations. 


Inner core: The liquid layer is referred to as the outer core, because yet another 
layer was found at even greater depth. At the true centre of the Earth, lies a solid 
inner core, with a radius of approximately 1216 km. This was distinguished from 
the outer core because of a change in velocity of the P-waves again, and a 
detection of the boundary by seismic studies of underground nuclear testing. 


As technology improves and our ability to measure seismic vibrations is 
enhanced, our knowledge of the Earth’s interior will also be enhanced, leading to 
greater opportunities for learning and making sense of the world. Earthquakes aren’t 
just something to learn about, we can learn from them as well. 


98 


Chapter 8. Volcanoes 


Earthquakes are evidence of a dynamic Earth — one that moves and changes in 
ways that are both predictable and not. Volcanoes are evidence of this as well, and not 
surprisingly, are also both predictable and not. Though we live with the understanding 
that rocks are solid, seemingly eternal, and the foundation of our physical world, 
volcanoes remind us that below our feet, rock can melt and move with explosive force. 


Volcanoes are vents that allow movement of material from below the Earth’s 
crust to the surface. They exist in many different stages of evolution. For instance, 
“active” volcanoes are those that have erupted in recorded history (eg. Mount St. 
Helens). “Dormant” volcanoes have not erupted in recorded history, but may again (eg. 
Yellowstone). Finally there are those volcanoes that we deem to be “extinct”, because 
we believe they have little or no chance of erupting again (eg. Mt Kilimanjaro). 


The term “volcano” comes from the ancient Roman God of Fire, Vulcan. Vulcan 
was recognized for his powers of both destruction and fertility, a reflection of our dual 
relationship with fire. Similarly, volcanoes are both hazardous and beneficial to 
humanity. In addition to providing new and fertile parent material from which rich soil 
is derived, volcanoes also create a relatively cooler and moister environment in which to 
live — a haven for those in hot areas. The slopes act as an impediment to air flow, 
causing air to rise and cool, which lessens its ability to hold water, resulting in 
condensation (clouds) and precipitation (rain / snow). In many cases, volcanoes are 
associated with productive, fertile growing areas, and the availability of fresh water. 
This is important to remember when we consider why large population centres are 
located alongside potentially dangerous volcanoes, such as the once thriving metropolis 
of Pompeii, Italy (Mt. Vesuvius). Mexico City’s 20 million residents are knowledgeable 
about their neighboring volcano — its name “Popocatepetl” means “smoking mountain” 
in the native Nahuatl language of the original Mexicans of the region. However, Mexico 
City is situated just far enough away from the volcano (65 km) to be considered fairly 
safe for its citizens. In 2001 when Popocatepetl erupted, 70,000 people from its slopes 
were evacuated, but the winds blew the damaging ash and smoke away from Mexico 
City, so day-to-day activities in the city were not dramatically affected. However, there 
are always risks for communities adjacent to volcanoes. 
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8.1. Understanding Risk 


So what constitutes risk? Scientists who study volcanic hazards look at 
population densities and their spatial distribution with respect to proximity to 
potentially active volcanoes, of which there are between 1500 and 2000 in the world. A 
Volcanic Population Index (VPI) is an objective measure of risk to human health by 
volcanoes. The greater the population numbers living close to a volcano, this higher the 
index value, which indicates a higher risk. Volcanoes erupting in non-populated areas 
have a low risk, as they are not considered to be directly hazardous to humans’ health, 
though effects such as aerosols, dust, and ash can still be felt in more distant locations. 
VPI; constitutes the population living within 5 km of the vent, which would have to be 
evacuated should a volcano occur. VPlio considers the population within 10km of the 
vent, which would be directly affected by the volcanic activity, though acute hazards 
would be inconsistently felt. The residents of Pompeii were located almost 10 km from 
Mt. Vesuvius, yet experienced immense heat and deadly ash falls at that distance. 


In Central America alone, approximately 5 million people live within a 10 km 
risk zone of Holocene (recent geologic history, less than 10,000 years) volcanoes, and 
half of that population is in the risk zone of historically active volcanoes. One third of 
El Salvador’s population lives within a 10 km zone of a potentially active volcano. 
Residents of Japan, Italy, Iceland, Indonesia, Ecuador, Philippines, Chile, New Zealand, 
and the United States are just a few of the world’s population for whom volcanic 
hazards shape everyday life. 


8.2. Hazards 


Hazards are typically considered only “hazardous” if they interfere with, or 
affect people. In this sense, what people do, and the situations we create, define what 
hazards we face. Consider air travel. Within the last 100 years, we have built a global 
transportation network in the skies, a system people have come to depend on both in 
safety and consistency. Aircraft engines however, are extremely susceptible to damage 
by volcanic ash. Volcanic ash has the potential to travel immense distances in the upper 
atmosphere, and can affect aircraft (and therefore people and places) that were nowhere 
near the original eruption. The Puyehue volcano that erupted on June 4*, 2011, in Chile, 
caused the disruption of air travel not only in neighbouring Argentina, Uruguay, and 
Brazil but by June 11, it also affected Australia and New Zealand, as the ash cloud 
moved west across the Pacific Ocean. 


100 


Air travel across Europe was severely interrupted in 2010 when the 
Eyjafjallajokull volcano in Iceland erupted, sending ash high into the atmosphere. The 
hazard in this case was enhanced by the fine consistency of the ash, which made it light 
enough to stay airborne, and the prevailing winds which blew the ash right over the 
flight paths of Europe’s busiest airports. Approximately 10 million travelers were 
affected by the most extensive closure of European airspace since World War II. 


In 2011, another eruption of an Icelandic volcano had many people worried 
about a similar situation. The ash from this eruption was coarser though, so did not 
remain airborne as long. In addition, following the Eyjafjallajokull eruption, many 
questioned whether the flight ban was required, and many tests were conducted to 
understand the risk of volcanic ash to engines, and to provide guidelines for safe flight 
conditions. Before the Eyjafjallajokull event, ash was considered to be too damaging to 
fly through; the glass particles within it too abrasive and damaging for the engines. 
Planes had to give ash plumes a wide berth, and the inconvenience of having a longer 
and costlier flight was an inconvenience that could be tolerated occasionally. However, 
the ash cloud affecting many of Europe’s busiest airports was unprecedented. With 
millions of passengers affected and a huge financial consequence, travelers and 
businesses both wondered about the risk. Was there an ash-content threshold that 
could divide safe flying conditions from unsafe? 


Studies since that time have indicated that the ash was particularly damaging, so 
the flight ban may have been appropriate, but new regulations allow for more flexibility 
in choosing whether to fly or not. Improved sensitivity of ash measuring instruments, 
testing of individual ash clouds and ensuring pilots are given that information as part of 
their meteorological updates, along with regulations ensuring detailed equipment 
inspections after every flight are all now components of the flight risk-assessment plan — 
a scientific way of evaluating and balancing the demands of people against a natural 
hazard. This is a balancing act that tends to shift back and forth depending on our 
complacency or fear at the time. This highlights a constant tension between science and 
policy, between scientists and those with a business, political, or even emotional 
agenda. When our safety is taken for granted or assumed, our tolerance for risk 
becomes much greater. In the realm of Earth Science, this is where literacy is critically 
important — to be able to objectively see and assess hazards and risks, and to advise 
appropriately. 


8.2.a. The Case of St. Pierre 


The beautiful town of St. Pierre was located at the base of a volcano, like many 
others towns in the Caribbean Islands. The volcano, Mt. Pelée had not erupted for two 
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hundred years, but when signs occurred that it might erupt again, the citizens ignored 
the warning signs. It was a complacency that would cost 30,000 lives. Prior to its 
eruption in the morning of May 8t, 1902, many clues should have indicated that the 
risk to the town was increasing. In February, sulphurous gas was noticeable. In April, 
steam was seen coming from the vents high up the volcano slopes. In later April the 
ground shook from tremors. Ash actually started falling on the town on April 25 and 
26th, as the crater erupted, and a lake of boiling water soon developed. Sulphurous gas 
and ash were plentiful enough to cause residents to avoid being outside, or to cover 
their faces in order to breathe. Yet, on May 24, an article in the paper appeared to 
remind citizens of the upcoming picnic to be held on the volcano. There was little in the 
way of official warning of the volcano’s potential for destruction. One suggested reason 
was that there were important elections to take place on May 11, and the politicians 
didn’t want people leaving the island before the vote could take place. Even a captain 
who piloted his cargo ship away from the island when it was only half full (because he 
did not like the volcanic activity he was observing) was threatened with legal action by 
the port authorities. In any case, on May 8*h, the mountain erupted with a blast of 
poisonous, superheated air that literally obliterated stone buildings, knocked ships over 
out at sea, and threw cannons and statues off their foundations. The entire population, 
save two people, were killed by the blast, the heat, and the poisonous gases in the 
“glowing cloud” (nuée ardente) that issued from Mt. Pelée. St. Pierre became a studying 
and research ground for volcanic processes and an example of the need to provide risk 
education and appropriate warning to citizens living in the shadow of a volcano. 


8.2.b. The Case of Mount Pinatubo 


Mount Pinatubo is situated 90 km from the capital city of the Philippines, 
Manila. About 30,000 people lived along its flanks, and a US military base — Clark Air 
Base — was home to 18,000 personnel and their families. The eruption that occurred on 
June 15, 1991 was globally the second largest of the century. The magnitude of the 
eruption and the number of people living in close proximity could have resulted in a 
massive death toll. However, scientists and the government were attentive to the signs 
of the imminent eruption, and action was taken to lessen the consequences of Mount 
Pinatubo’s eruption. Warning signs began almost a year prior to the eruption, in the 
form of a major earthquake, followed months later by multiple small earthquakes near 
the volcano. Two months before the actual eruption, explosions like the small eruptions 
at Mt Pelée resulted in ash covering local towns. This triggered the first set of 
evacuations — 5,000 people. With continued activity, an evacuation area of 20 km around 
the volcano resulted in another 25,000 people being moved from the assessed hazard 
area. Ultimately 58,000 people from a radius of 30 km from the volcano, and the 
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American military base, were evacuated. Three days after this evacuation was 
implemented, Mount Pinatubo erupted, releasing immense amounts of ash and sulphur 
dioxide gas, which ended up influencing global weather for 2 years, and leaving a 
heavy ash layer over 2000 km2. Approximately 800 people were killed, and property 
damage was extensive. The upheaval in terms of displaced people and economic effects 
were enormous, but could have been much worse without the attention to the volcano 
and advising people appropriately. 


8.3. Nature of Eruptions 


Besides having people in close proximity to a volcano, other risk factors include 
the volcano itself. What type of eruption will it have? Will it “flow” or “blow”? What 
will be the main hazards that the population will need to protect themselves from? 
Understanding volcanic processes, origin of the magma and resulting volcanic 
products, and the geologic situation of the volcano helps us to understand these other 
risk factors, and appropriately mitigate if possible. 


Volcanoes eruptions can either “blow” - erupt violently, like Mt Pelée , or “flow” 
like the Hawaiian volcanoes. Compare the experiences of residents living close to 
Mount Merapi, Indonesia and Mount Kilauea, in Hawaii, USA. Mount Merapi, the “fire 
mountain” is the most active volcano in Indonesia, and has been active since the 
mid-1500s. Knowing that, thousands still live on its flanks. In 2010, eruptions caused 
350,000 people to be evacuated, and hundreds were still killed by hot debris flows, 
falling rock debris and the ash cloud that was generated from this explosive volcano. 
Lava flowed into the rivers, rains mixed with ash to create massive mud flows, and 
thick clouds of toxic gases caused extensive respiratory problems to thousands of 
people. 


On the Big Island of Hawaii, another volcano was erupting, but the scenario was 
very different. Mount Kilauea’s destructive power is slower moving, but still insidious. 
Kilauea has been erupting almost continuously since 1983. Like Mount Merapi, people 
live on the flanks as well, despite the dangers associated with it. In the summer of 2010, 
residents lost homes, but were able to stand beside and watch as lava slowly crept 
toward their home and first ignited, then engulfed them. The eruption and hazards of 
Kilauea have a different scale and pace, though are as uncontrollable. Blocky lava 
flowing toward the ill-fated homes moved at pace of around 77m/day. Slow enough to 
out-walk, but still unavoidable. 
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What makes the eruptions of these two volcanoes so different? It all relates to the 
type of magma (molten rock) that is creating the volcano, which ultimately relates to 
where the volcano is located, especially in relation to the tectonic plates. 


Magma is the molten material that moves through the volcano to create lava. 
Understanding magma composition and how it affects volcanic eruptions, and risk to 
settlements, is much like understanding the pathogen responsible for a disease. When 
the source is understood, the treatments and precautions that can be taken are also 
better understood. Whether an eruption is explosive or not, is highly dependent on the 
viscosity of the magma. Viscosity refers to the resistance to flow — highly viscous 
materials move slowly like peanut butter, whereas less viscous materials can flow like 
syrup. However, how viscosity affect eruptions might seem counterintuitive. It is the 
viscous, thick magma that creates the explosive event. 


What determines magma viscosity is primarily chemical content and 
temperature. The key chemical is silica — the higher the silica content, the stronger the 
bonds within the magma, and the higher the viscosity. The more explosive volcanoes 
are associated with continental crust, as it has higher silica content than oceanic crust. 
Temperature has the opposite relationship — the higher the temperature, the less viscous 
the magma. Water content also plays a role — increased water decreases the melting 
point of minerals in the magma, and causes the eruption to be more explosive. With 
increased water content comes an increase in gas content (bubbles) that explode as 
pressure on the magma lessens as it rises closer to the Earth’s surface. 


Gasses are trying to make their way to the surface, and when the magma is 
runny, it is not a problem for gas bubbles to escape (think of bubbles in a pot of boiling 
water). Now picture a pot of thick, viscous spaghetti sauce, and what happens when 
bubbles try to escape as it boils? There are miniature explosions and the spaghetti sauce 
erupts right out of the pot, flies through the air and makes a mess! 


Before an eruption, magma moves up from the magma chamber (like a holding 
tank under the volcano), and as it moves up the volcanic vent, the pressure is released, 
and the gasses rise to the surface (like opening a bottle of pop). If the gas can easily 
escape through the magma (ie. the magma is not viscous), the gas will exit, and may 
make a nice, but fairly harmless show of lava fountains, but not much explosive action 
actually occurs. This is the case of the Hawaiian Islands. 


When the magma is viscous, explosive eruptions occur. The volcanoes that have 
affected the Calgary area most recently have been explosive eruptions — Mount St 
Helens in Washington State erupted in 1980, and fine ash landed throughout southern 
Alberta. There is also a layer of ash in the local soils known as Mazama ash, which 
creates an important time marker in soil layers and geologic profiles. It indicates the 
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date of 6,800 years ago, when Mount Mazama located in Oregon erupted, leaving 
nothing but a large crater behind. Today, Oregon’s Crater Lake, the remnant of Mt. 


Mazama is a popular tourist destination. 


8.4. Locations of Volcanoes 


The factors that determine the eruption characteristics, and the actual 


distribution of volcanoes, are typically tied to plate tectonics. A comparison of global 
locations of volcanoes with plate boundaries and earthquakes reveals a definite spatial 
relationship between all three. Continental volcanoes tend to be aligned with converging 


boundaries of tectonic plates, such as the Andes Mountains along the west coast of 
South America, and the Cascade Mountains running parallel to the Pacific Coast in 
Northern California, Oregon, and Washington. The explosive volcanoes in Alaska, 

Indonesia, and Central America are similarly situated alongside subduction zones, 
where two converging plates cause one to be “subducted” under the other. When a 


plate subducts, it descends beneath the other, and begins to angle down into the mantle. 
The heat found at depths of around 100 km releases volatiles such as water from the 
subducting crust. This lowers the melting point of overlying rocks, which begin to melt, 
creating molten magma. This magma begins to rise through the mantle because of its 
lighter density, and creates a conduit to the surface, which gives rise to what we know 
as a volcano. Volcanoes produced in this manner typically form a series of volcanoes, 


aligned parallel and equidistant to the trench. 


Three types of convergent boundaries exist: 


Oceanic-continental: When oceanic 
crust meets continental crust, the higher- 
density oceanic crust subducts below the less 
dense continental crust. Similar to the 
oceanic-oceanic boundary, the geographic 
situation is one of a deep trench where the 
two plates meet, and volcanoes occur parallel 
and equidistant to the trench (Figure 25). This 
time, they form on land, like the Cascade 
Range along the coast of the northwest 
United States, where the Juan de Fuca plate is 
subducting under the westward moving 
North American plate. This type of volcanic 
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Figure 25. An example of an oceanic- 
continental convergence and the resulting 
continental volcanic arc. 
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range can also be termed a “continental volcanic arc”. Volcanoes along the Ring of Fire 
(either continental or island based) are volcanic arcs caused by the subduction of an 
oceanic plate. 


Oceanic-oceanic: In this zone, a ict aca 
oceanic crust pushes against oceanic crust, ier -* 
and one of the layers descends below the <> Oceanic 
other (subduction). A deep trench occurs lll 
where the plates meet, and approximately 

300 km beyond the trench, on the side of 

the overlying crust, a series of volcanic 
islands appear (Figure 26). The volcanoes 
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Figure 26. At the convergence of two oceanic 


which include J apan and the Aleutian plates, it is common for volcanic islands to form. 


Islands. 


Continental-continental: When 
continents collide, the result is compression and buckling and orogeny — mountain 
building (Figure 27). The Himalayas are an example of a recent continental-continental 
collision between India and Asia. Older 
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layer of the earth’s crust spreads and thins, 
magma rises from the mantle and 


, mee Figure 27. Convergence of two continental plates 
extrudes, trying to fill in the space. results in mountain building or orogeny. Creative 
Commons (2012). 


Two types of divergent boundaries exist: 


Continental: fundamental to the theory of plate tectonics and the breakup of 
Pangaea, is the idea that continents can move together but then split up again. The split 
refers to a continental divergence, whereby a rift or break forms in the land, and the 
land mass begins to separate — much like what has occurred in the Red Sea in the 
Middle East, and the Rift Valley in Africa. Volcanoes occur as magma wells up in the 
thinning crustal area (such as Mt. Kenya and Mt. Kilimanjaro). 


Oceanic: sea floor spreading, as postulated by Harry Hess, whereby a mid- 
oceanic ridge is the site of ocean floor creation, relies on the evidence of a divergent 


106 


oceanic boundary to hold weight. His theory is well accepted, not only because of 
ancillary evidence (magnetic-striping indicating a moving crust) but also because of 
features like the Mid-Atlantic Ridge, where fissures and volcanoes bring magma to the 
surface and new crust is formed (Figure 28). While most of the divergent oceanic 
boundary activity remains below sea level, Iceland, situated on the Mid-Atlantic Ridge, 
is an example of divergent boundary volcanoes building up land above the surface. 
The Mid-Atlantic Ridge, located in the middle of the Atlantic Ocean, is the world’s 
largest mountain range. 
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Figure 28. Rising magma creates a rift zone as the oceanic plates move apart. This process is responsible 
for creating such features as the mid-Atlantic Ridge in the Atlantic Ocean. Creative Commons (2012). 


Volcanoes can also occur in places unrelated to tectonic boundaries. Intra- 
plate volcanoes refer to those that appear in the middle of a plate, generated by 
“hotspots” in the mantle. As a plate moves over a hotspot, the crust “blisters” and lava 
is expelled onto the surface, creating a volcano. The Hawaiian Islands are a chain of 
such “blisters”, describing the movement of the Pacific Plate over a hotspot. The islands 
are successively younger from northwest to southeast, where the youngest, Big Island 
of Hawaii is still active (Figure 29). Connected to the Hawaiian Islands is the Emperor 
Seamount chain that consists over 80 underwater volcanoes that were also formed from 
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this same process. This volcanic range stretches over 5800 km and includes the visible 
Hawaiian Islands. 
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Figure 29. The location of the Hawaiian Islands. Hawai’i is the youngest island while the northwest 
islands are approximately 30 million years old. This chain of islands was formed as the Pacific Plate 
moved across a volcanic hotspot. Creative Commons (2012). 


8.5. Composite Volcanoes 


Convergent boundary volcanoes are characterized by their explosive nature — 
especially those that are created through the oceanic-continental interaction. The 
continental crust from which the volcanoes are derived are high in silica, and the water 
from the subducting oceanic crust gives rise to a gaseous, explosive phenomenon. The 
volcano grows from a mere conduit, or pipe, in the ground, accumulating successive 
layers of ejecta and lava until it forms a very symmetrical, pyramid shaped appearance, 
characteristic of volcanoes like Mount Fuji, Mt. Shasta, and Mt Etna to name a few. 


These volcanoes are called “stratovolcanoes” or “composite volcano” structures. 
The size of material ejected along the slopes helps define this shape. Coarse material 
that settles close to the vent has a steep angle of repose, and therefore keeps the slopes 
steep, while finer material further away can only support a gentler slope. At the top, a 
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depression, the crater, is located at the opening of the vent. The crater forms as material 
is ejected a short distance out, continuously building steep walls around the vent. 
Although lava issues forth from the central summit vent, it can also emerge through 
vents that occur on the flanks. These vents can remain fairly harmless, allowing only 
gas to escape (in which case they are called “fumaroles”), or they may become 
secondary vents, building another cone on the original (“parasitic cone”). 


Clouds of toxic gas and ash, lava, pyroclastic bombs (rock fragments), glowing 
avalanches (nuée ardente) like the cloud that killed the residents of St. Pierre, and lahars 
(mud-debris flows) are all products of explosive eruptions that are hazardous to people. 


Most characteristic of the explosive volcanoes is the plume of ash and debris that 
explodes from the volcano. This pyroclastic (pyro — fire, and clastic — fragments) material 
ranges in size from powder to boulders, and consists of pulverized rock, lava and glass. 
It is the glassy bits that make ash so gritty and dangerous to aircraft. 


The very fine ash can travel long distances in the atmosphere, and settle out far 
from the original eruption site. Ash deposits are common in soil layers around the 
world. Coarser materials that act more as projectiles are named based on their size. 
Lapilli (little stones) or cinders refer to material up to about 6cm in diameter. Larger 
lava “stones” that are thrown out are called “blocks” or “bombs” depending on if they 
have hardened already or not. Bombs are still semi-molten, and can take on a 
streamlined appearance as they finish cooling in the air. Bombs the size of 6-m long and 
200 tonnes have been recorded 600 m from the volcanic vent. These are some of the 
more direct hazards from an explosive volcano. Volcanic rock that contains a lot of 
bubbles, or vesicles, is called pumice. The bubbles make these rocks surprisingly light, 
sometimes even buoyant. 


Flows of pyroclastic material can be some of the most deadly effects of a volcano. 
Hot gases mixed with ash and coarser materials can create a “glowing avalanche” that 
can travel exceptionally quickly and far (up to 100km) because of the buoyancy 
provided by the gases, and steep slope of the composite volcano. Referred to as a Nuée 
ardente, this flow can destroy everything in its path in seconds, as occurred in St Pierre, 
Martinique. 


When the debris from volcanic eruptions is mixed with water, deadly flooding 
can result. Mudflows called lahars can be triggered by glacier or snow melt during an 
eruption, or by extensive rainfall saturating volcanic debris. Quick downslope 
movement of these flows is facilitated by the steep sides of a composite volcano. Most 
of the destruction from Mount St Helen’s was due to lahars, and these volcanic induced 
mudflows are responsible for thousands of deaths around the world. 
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8.6. Shield Volcanoes 


Volcanoes located in divergent zones and intraplate zones tend to be less 
explosive than in convergent zones. The oceanic crust from which they are derived 
contains less silica, and the lava is more fluid, less viscous. The volcanic structures that 
form from this fluid lava are very broad, gently sloped “shield” volcanoes. These can be 
massive structures — Mauna Loa, which is part of the island of Hawaii, is actually larger 
than Mount Everest: 9 kms high from its base on the sea floor to its summit. 


The products from these volcanoes are primarily extensive lava flows. In Hawaii, 
this takes the form of lava tubes and ropey, slow moving lobes of lava called pahoehoe, 
and blocky aa lava, which looks like a wall of rubble moving across the landscape. On 
the most recognizable products are pillow lavas, round lava deposits created 
underwater from successive cooling and extrusion of lava. 


For centuries, there was a mystery was to why volcanoes behaved differently 
around the world. It was difficult to reconcile the gentleness of the Hawaiian volcanoes 
such as Mauna Loa with the violence of a Mt. Pelée. No explanation really made sense 
until the theory of plate tectonics was put forward. Plate tectonics seemed to make 
sense of the volcanic questions and volcanic evidence (distribution, different types of 
eruptions) provided further evidence for the theory. 


8.7. Indirect Hazards 


Besides the direct hazards that were ascribed to each type of volcano, there are more 
indirect hazards can affect humans and their long-term survival in an area. 
° Earthquakes, earth tremors: many usually precede an eruption 
° Tsunamis can be caused by underwater volcanic activity or underwater 
landslides due the changing shape of a volcano or deposition of materials into 
the water. 
Contamination or elimination of clean water supplies through toxic materials, 
heat, or blast. 
° Poisoning and destruction of the vegetation including food crops, through acid 
rain, and toxic gasses. 
° Physical damage to crops, landscape, infrastructure from initial eruption. 
* Loss of home, shelter, fresh water and health. 


After Mount Tambora erupted in Indonesia in 1815, approximately 92,000 people 
died — 12,000 from the initial volcanic activity, and 80,000 from starvation during the 
aftermath of the event, after all the vegetation on the island was destroyed. Iceland has 
faced similar tragedy — in 1783 one quarter of its population died from starvation after 
poisonous gases from the eruption of Laki killed off much of the livestock. More 
recently, the 2004 Indian Ocean earthquake and tsunami took the lives of almost 230,000 
people with many more people injured or displaced. This earthquake and the resulting 
tsunami is now considered one of ten worst earthquakes in recorded history with over 
1.6 million people displaced from their homes. 


Natural disasters are often a springboard into humanitarian crises, since 
sanitation, food and shelter are not often readily available, creating an opportune 
situation for the outbreak and spread of disease. In a meeting of scientists who study 
the confluence of dangerous volcanoes and human population, they have concluded 
that not only does hazard mapping and monitoring have to continue, but awareness 
and education of the community is critical to prevent large scale disasters from volcanic 
events. 
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Chapter 9. Floods 


The great conundrums of life often show up in water. If we view water as both a 
source of wealth and worry, we are often reflecting upon the vast power inherent in 
floods. 


Of all major natural disasters such as earthquakes, tsunamis and cyclones, each 
have been eclipsed by the devastation of floods. While over 250,000 people were killed 
in the historic 2004 Indonesian earthquake and tsunami, and again over 250,000 people 


killed in the Haitian earthquake of 2010, they both pale in comparison to flood disasters. 


In 1931, the Yellow River in northern China for example, left behind a watery deluge 
killing up to 4 million people. Surprisingly, it was not a one off, chance occurrence for 
the Yellow River. This river and others have taken millions of lives throughout their 
history with humans. Unfortunately, devastating floods often occur due to a series of 
incremental actions induced naturally and by humans that lead to an inevitable and 
often difficult to prevent disaster. 


The great conundrum is that the potential devastation of floods sits in contrast to 
the extreme fertility and productivity of flood plains. Wherever rivers exist, people 
gravitate to their edges where farming and economic activities are most lucrative. Yet 
farming brings an increase in food production. An increase in food production leads to 
an increase in population. Soon, intensive farming and urban activity becomes dense 
and inflexible. The stakes rise as it becomes increasingly important to prevent flooding. 
By this point, an almost inevitable feedback loop has been set in motion and the 
reaction across time and space has often been the same. A fear of flooding leads to 
actions that place restrictions on the river to make sure it does not spill over its banks 
and destroy the now highly developed floodplain. Dykes and levees are built; rivers are 
dredged and rerouted. Vast floodway overspill schemes are built. 


Rivers however, are in a constant state of flux between erosion and deposition, 
and ultimately, relocation. As rivers run from their headwaters to their mouths, the 
variables of velocity, width and depth determine where the total discharge of the river 
will deposit sediment, or erode its banks. Attempts to confine rivers either behind dams 
or with levees inevitably results in sediment filling in behind the dam or on the bottom 
of the river. In central China for example, the Yangtze River has been continuously 
depositing its high sediment loads on the bottom of the streambed, resulting in a rising 
base level. To keep the river in its channel, levees must constantly be raised in response 
to the ever-rising stream bed. This cat and mouse response has meant that in some 
places around the major urban centre of Wuhan, the bottom of the Yangtze River is now 
10 metres above the floodplain. Any break in a levee or dam has been devastating. To 
attempt even further control over the river, construction of the world’s largest dam, the 
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Three Gorges, was the response in 2008. The irony is that floodplain fertility is 
dependent upon the silt of floods and floods are at the same time, one of the most 
devastating impacts on farmers operating under extremely fickle circumstances. 


While China often experiences extreme examples of flood dynamics, the same 
process plays itself out the world over, including North America. The Mississippi River 
for example has been plagued by the conundrum of fertile floodplains being attractive 
to farmers and navigable portions that are lucrative for shipping. Key urban centres 
such as New Orleans have become dependent upon the river remaining in its current 
location. Yet by a natural process known as delta shifting, the Mississippi, like most 
rivers, has changed the location of its mouth on numerous occasions. The shift can be 
dramatic with the rivers new location being often tens or even hundreds of kilometres 
away, effectively negating the current port cities as irrelevant. This occurs as sediment 
builds up in the slow moving waters of the river mouth, eventually filling it in. As this 
happens the river finds a new and steeper route to its exit. Consequently, great pressure 
in the form of levee construction and maintenance is exerted to maintain the current 
course of the river. Every action however, leads to an increase in the likelihood of 
flooding. 


Any alteration to a river can lead to unknown consequences, but what we do 
know is that no matter what we do, a river will always, at some point, flood. 


9.1. Why Do Rivers Flood? 


Flooding occurs whenever the capacity of the stream channel is exceeded. This 
typically occurs in acute phases during spring runoff as excess water and snowmelt in 
the drainage basin fills the downstream portions and overflows onto the surrounding 
floodplain. It can also occur as flashfloods during short duration intense rainfall where 
the ground surface flow is faster than the ability of the soil to absorb the water. The 
opposite of this is prolonged floods where unusually heavy precipitation occurs over 
an extended period of time, exceeding soil saturation capacity. The result is a slow but 
persistent increase in water levels. Of course, floods do not always occur in every year 
on each river or stream. 


Chronic flooding events will intensify each year if the capacity of a stream 
channel decreases in relation to runoff. This can happen through such things as 
sediment accumulation in the stream itself or indirectly from reduced soil infiltration as 
urban development or deforestation increases. 
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9.2. Predicting Floods 


We can’t make predictions about when a river will flood. Indeed, we consider 
floods to be random events. We can however, calculate a probability of how often a 
river will flood based on what it has done in the past. If we know for example that 
random tosses of a coin should ultimately result in having the coin tosses turning up 
half heads and half tails, we can examine a river from the same perspective. If we have 
enough data over a long period of time, we can say that the most extreme flood event 
over a hundred years for example, becomes the 1 in 100 year flood event. This is 
expressed as the probability 0.01. And this probability is what is now used by most 
insurance companies, both private and government operated, to assess whether or not 
someone will be given flood insurance and what their rate will be. 


Of course this is a probability with a recurrence interval of 100, with the chance 
of it occurring once in that interval. And it’s data that tells us what we need to know. 


If we examine this scenario from a different perspective, and we want to know 
what is the probability of having a flood above a certain height or volume, we can look 
at the data. Let’s say we find that five floods over 100 cm have occurred in a time span 
of 50 years. We then have: 5 floods in 50 years. That becomes 1 flood in 10 years. As a 
probability we can say 0.10 or express it as a 1 in 10 year flood. 


But what if we don’t have enough data over 100 years or for the particular set of 
time we are interested in, whether its 10 years, 50 years or 500 years? We must 
extrapolate our data. We have a pattern of floods over time. Some floods are small and 
numerous. Some floods are large and infrequent. These data will point towards a rising 
extreme over time. 


But all is not so easy. Even if we have a low probability of 0.01 for example, we 
can still have flood events that not only exceed expected water levels but also exceed 
our recurrence interval. So as with our coin tosses, it may happen that we turn over four 
heads in a row before getting a tail, floods can happen in the same manner. So while it’s 
unlikely, it has indeed happened that two or more major floods have happened within 
less than a 100 years and even back to back. In fact, floods are not as random as we 
would like them to be for our purpose of probability calculations. Many factors affect 
the actual occurrence of floods and our anticipation of their actions will become more 
accurate over time as data increases, as we take into consideration the effects of land use 
changes, climatic fluctuations, and changes in the river drainage basin area. 


Sometimes floods are associated with extreme weather events such as hurricanes 
and these events can cause flash floods. Hurricanes are common along many coastlines 
and they have the capacity to drop large quantities of water onto already saturated 
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land; further exasperating flood conditions. Hurricanes form when several 
thunderstorms rotate in an area of low pressure. These hurricanes take energy from the 
warm ocean water; as long as a hurricane is over warm water it grows, but as it moves 
away from the tropical waters it dies. We can use our same probability calculations to 
determine the likelihood of a hurricane event. 


Miami Hurricane: During a hurricane season in the Gulf of Mexico there is a 10% 
chance that a category 3 or higher hurricane will hit Miami in any given year. The 
higher the category of the hurricane (ranging from 1-5), the more destructive power it 
has. Assuming these chances are independent from one year to the next, what are the 
chances a category 3 or higher hurricane will hit Miami in the span of 15 years? 


Solution: The chance of the hurricane (category 3 or higher) not hitting in any year is 
90%. Therefore, the chances of the hurricane not hitting in the next 15 years (assuming 
independence) is given by: 


(0.9) 15 = 0.2059 


which translates to about 20.59%. Therefore, the chances the hurricane will hit during 
the next 15 years is given by 


1 - (0.9)!5 = 0.7941, 


that translates to 79.41%. That’s a lot to be stressed about! 


New Orleans Hurricane: During a hurricane season in the Gulf of Mexico there is a 12% 
chance that a category 3 or higher hurricane will hit New Orleans in any given year. 
Assuming these chances are independent from a year to a year, what are the chances the 
hurricane (category 3 or higher) will hit New Orleans in the span of 8 years? 


9.3. What Can We Do About It? 


We have two problems that are not technological but conceptual. Complacency 
due to landscape amnesia is often our own worst enemy. Landscape amnesia is the idea 
that people who live in a certain place often don’t notice the subtle yet constant changes 
that occur on the landscape around them. A small landslide, a few new buildings each 
year, a small forest cleared away for farmland, or some erosion of a nearby stream often 
go relatively unnoticed. After many years, the landscape seems to be the same as it has 
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always been. Yet a person, who left that place and comes back decades later, will readily 
notice the often extreme differences accumulated over time. We all are afflicted with 
this. 


And so we often miss the small changes that alter our landscape slowly and 
incrementally. We even forget about large events such as floods that may have 
happened beyond our own lifetimes. Slowly and incrementally we build up farms and 
low-lying buildings on floodplains, we build dykes and levees that lead to increased 
sediment build up in the bottom of rivers, decreasing stream capacity and leading 
eventually to another devastating flood. Entire cities are built on floodplains with little 
adaptation until a major event occurs to reawaken us. The river itself is constantly 
readjusting to a state of equilibrium. It becomes imperative that long term data be 
gathered and maintained and then used in a regulatory framework of rules and laws 
that can span the time of landscape amnesia so that we can still live on floodplains, but 
in recognition of the river’s dynamic nature. 


But our second conceptual problem is one of a reinforcing feedback loop of fear 
and control. As floodplain use increases, attempts are made to increase control over the 
river. We know today that flood control measures often result in a decrease of small but 
regular floods, while increasing the severity of large-scale and devastating floods. Each 
physical control measure, whether it is dams, dykes or levees, results in various 
problems that seem to push us closer each year to a major catastrophe. The more we 
attempt control the more we fail on a large scale. We have yet to find a viable solution. 
Instead we face the continuous prospect of both natural and man-induced floods each 
year that bring wealth to some and worry to others. 


Where we place our homes and businesses comes down to reliability of data, the 
probability of an event occurring, uncertainty, and severity. But floods are usually the 
beginning of the disaster, for water beyond its banks, becomes one of the greatest 
sources of disease outbreaks, especially in places that are unprepared. 
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Credit: J. Haney Carr, Centers for Disease Control, National Institutes of Health 


Prevalence and Spread of 
Infectious Diseases 


The image above shows a microscopic organism that was collected from a wild 
stream. Itis common to find many different organisms in untreated waters, and 
although many are harmless, others may cause serious health concerns. Before we can 
learn about disease outbreaks, we need to understand the world we live in, how 
diseases originate, and how to prevent or treat disease. 
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Chapter 10. We Live in a Microbial World! 


Single-celled, microscopic organisms have dominated the planet ever since life 
first appeared almost 3.5 billion years ago. The simplest of these organisms are 
prokaryotes, organisms such as 
bacteria that lack a nucleus. Bacteria 
exist everywhere: in the soil, in the 
ocean, in freshwater, in the 
atmosphere, and in us. So numerous 
are bacteria on and within a human 
body that bacterial cells outnumber 
our own cells by 10X (See Textbox: 
Scientific Notation). The vast 
majority of bacteria are beneficial; 
Earth’s ecosystems could not 
function without the essential 
services these microbes provide in 
nutrient recycling and nitrogen 
fixation. 


10.1. Fossilized Microbes 


The oldest known fossils are those of stromatolites, mats of microscopic 
organisms similar to modern photosynthetic cyanobacteria. Scientists estimate these 
fossils to be 3.5 billion years old, the first known living organisms on Earth. How is the 
age of these fossils determined? We use exponential growth and decay! 


Geologists measure the ratio of radioactive isotopes present in fossil-bearing 
rocks and use the rate of radioactive decay to estimate the age of these rocks. (See 
Chapter 3). Remember that to exhibit exponential growth, the quantity has to grow as a 
percentage of the quantity itself present at any given time; if the growth were by a fixed 
amount, we would have linear growth! Radioactive decay is a game of chance. A nucleus 
is disintegrating with a certain probability and the rest is a law of large numbers. 
Higher the probability of disintegration shorter the half life and vice versa. No matter 
what the case is: the substance loses a percentage of its mass continuously. 
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Aging Fossils: A fossil is found in a rock containing 0.22 g of Uranium-235 and 1.12 g of 
Lead-207. Uranium-235 decays to Lead-207 with a half-life of 713 million years. Use 
the elapsed time of its’ half life to calculate the age of the fossil-bearing rock. 


Solution: 


The mass lost from nuclear decay is negligible when dealing with these quantities of 
rock so we don’t need to factor the mass defect into our equation. Thus, the total mass 
of the uranium and lead in the rock is given by: 

Total mass: 0.22 g + 1.12 g = 1.34 g 

Ratio of Uranium-235 (parent element) to the total: 

0.22/1.34 = 0.16 ratio, or 16% of the rock is Uranium-235. 


To calculate the half life, we use this ratio in our formula for exponential decay, where T 
= the half life of Uranium-235, which is 713 million years, and t is the time since 
fossilization occurred. 


y _ logQ/Q, 
T log0.5 


"3 
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log0.5 


—0.3010 


t=1858 million years 
t =1.86 billion years 


Thus, the fossil is estimated to be about 1.86 billion years old. 


Bacteria that grow on or in your body are known as your “normal flora” or 
“indigenous microbiota”, and they also provide many essential services. These bacteria 
help you digest your food, manufacture Vitamins K and B13 eliminate toxins and, most 
importantly, protect you from pathogenic (disease-causing) bacteria (See Textbox: Do I 
need a colonic cleanse?). Experiments have found that laboratory animals raised in 
sterile conditions without normal flora bacteria are much more susceptible to infection 
from pathogens such as Salmonella. For Salmonella to cause disease symptoms, an 
ingested dose of 10° Salmonella cells is needed in an animal with a healthy normal flora; 
for a sterile animal it takes a dose of only 10 cells to induce Salmonella symptoms (Fix, 
2011). 
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Do I need a colonic cleanse? 


Prior to a colonoscopy to screen for colon cancer it is necessary to clean the solid matter 
from the colon (large intestine). Typically this is done under the direction of a gastroenterologist 
using prescribed laxatives, enemas, or rinsing the bowel with a sterile, electrolyte-balanced solution. 
Screening colonoscopies have greatly reduced the mortality rate of colon cancer and are now a 
routine procedure performed on people 50+ years of age (American Cancer Society, 2011). 


However, colonic cleansing and irrigation have become popular alternative therapies among 
people under the age of 50. This practice is based on the now falsified hypothesis of “auto- 
intoxication”, which suggests that decomposing food within the digestive tract produces toxins 
responsible for symptoms such as headaches and fatigue. The concept of autointoxication can be 
traced to Ancient Egypt (Chen and Chen, 1989), long before the discovery of normal flora gut 
bacteria. Not only is there no scientific evidence to support the claims of alternative colonic 
cleansing (Chen and Chen, 1989; American Cancer Society, 2008), but the practice can be 
dangerous (Barrett, 2010). In some cases, alternative practitioners irrigate the colon with coffee 
and/or coffee grounds, which has been associated with electrolyte imbalances, dehydration, and 
heart attacks. There is also evidence that frequent colonic cleanses can disrupt your normal flora 
bacteria, and without these beneficial bacteria you will be more vulnerable to pathogens. Better to 
get your dose of coffee at the Tim Horton’s than through this pseudoscientific practice! 


10.2. Prokaryotes and Eukaryotes 


Prokaryotic organisms are not the only microbes with a long history on planet 
Earth: fossils of eukaryotes as old as 2.2 billion years have been identified. Eukaryotic 
organisms have their genetic material contained within a nucleus and include plants, 
animals, fungi, and simple eukaryotes collectively known as “protists” (Figure 30). 
Microbial protists and fungi (e.g., the “yeasts”) constitute a small proportion of your 
indigenous microbiota. Ruminant animals, such as cattle, sheep, and deer, house 
protists in their digestive tracts that break down the tough cellulose found in the grasses 
and other plants they feed upon. A ruminant animal that loses its microbial flora can 
die of starvation within days. 
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Figure 30. Eukaryotic cells are more complex than are prokaryotic cells, and have their genetic material 
contained within a nucleus. The genetic material of prokaryotic cells is not separated from the rest of the cells 
contents. Credit: National Center for Biotechnology Information, National Institutes of Health. 


Prokaryotes and eukaryotes possess all the criteria of living organisms (See 
Textbox: How do we know when something is alive?). Viruses, on the other hand, are 
not celled-based and therefore many biologists do not consider them to be living 
organisms. Nonetheless, viruses do possess several of the characteristics of life. They 
have the ability to reproduce, they have genetic material (DNA or RNA), and they most 
certainly evolve, often very rapidly and to the detriment of humans. Viruses span the 
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Figure 31. HIV is an RNA-based virus (retro virus) that 
has both a protective protein capsid surrounding the RNA 
strands and a lipid layer (membrane) surrounding the 
capsid. Credit: US National Institutes of Health; modified 
by C. Henderson. 


gap between living and non-living 
entities. So what is a virus? A virus is 
simply one or more strands of DNA 
or RNA surrounded by a protective 
protein coat (Figure 31). Some of the 
more complex viruses may also have 
a layer of lipids (fats) around the 
protein for extra protection. The only 
way a virus can produce is to “hijack” 
a living cell: it inserts its genetic 
material into the cell and uses the 
cell’s ribosomes to replicate the viral 
material. Often the cell is destroyed 
once this process has been completed 
(lytic viral cycle). In other cases, the 
viral DNA is inserted into the host cell 
DNA and each time the host cell 
replicates, the virus is replicated too 
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(lysogenic viral cycle). Any type of cell can be attacked by a virus — even bacteria are 
plagued by viruses. When it is not hijacking a living cell, a virus is inert and has more 
in common with the non-living than the living. 


Even if the microbial world included just bacteria, microbes would dominant the 
planet. Add to those numbers of bacteria all the single-celled protists, yeasts, molds, 
and viruses and it is easy to conclude that we are indeed living in a microbial world! 


How Do We Know Something is Alive? 


The most common elements in a living organism are carbon (C), hydrogen (H). Oxygen (O), 
and nitrogen (N); but chemical composition alone cannot tell us whether or not something is 
alive. So how does a living entity differ from a non-living one? Biologists have amassed a 
list of features all living organisms share: 


Cells: are the structural and functional unit of life. 
Reproduction: Living organisms have the ability to reproduce themselves. 


Growth and Development: Living organisms grow and often change forms (develop) 
throughout their lifespan. 


DNA: Growth, development, maintenance, and reproduction are directed by information 
encoded in the DNA molecules of the organism. The DNA of all living organisms is composed 
of different combinations of the same four building blocks (nucleotides). 


Energy: Living organisms must extract and utilize energy from their environment in order to 
maintain their complexity: growth, repair, and replication all require energy 


Regulation: To sustain life organisms must regulate their internal environment within a narrow 
range of conditions, regardless of fluctuating external conditions. 


Evolution: Reproduction and changes in the DNA code allow generations of organisms to 
change over time (evolve). 
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Chapter 11. Microbes as Agents of Infectious Disease 


There are two broad categories of diseases: diseases caused by a breakdown in 
the functioning of one’s own system (such as cancers and autoimmune diseases) and 
diseases caused by a microbial or parasitic agent (such as influenza or tuberculosis). 
The latter are referred to as infectious diseases and are the focus here. 


11.1. Discovery of Microbes 


Science and Technology go hand-in-hand: 
applications of scientific discoveries lead to new 
technologies and new technologies lead to new scientific 
discoveries. The birth and advancement of 
microbiology, the study of microbial life, can be traced to 
a revolutionary technology — the microscope. 


Antoni van Leeuwenhoek (1632-1723), a Dutch 
tailor, is credited with making the world’s first 
microscopes (Figure 32). Why would a tailor want to 
make a microscope to observe microbial life? In fact, van 
Leeuwenhoek had no knowledge of microbes when he 
started grinding glass to make magnifying glasses — his 
initial goal simply was to use a magnifying glass to 
examine the quality of cloth. However, van 
Leeuwenhoek started to look at other things under 
magnification and was amazed by what he found. 
Observing the microbial world became van 
Leeuwenhoek’s passion. Throughout his life he refined and improved his simple 
microscopes and made meticulous observations of what he saw: single-celled algae, 
protozoans, yeasts (fungi), insect body parts, sperm cells, and red blood cells. In 1676, 
he discovered organisms smaller than anything else he previously had observed; van 
Leeuwenhoek had discovered bacteria and his discovery would change the way we 
viewed the world. 


Figure 32. A reproduction of van 
Leeuwenhoek’s microscope. 


It was not until the late 19th century that scientists began to make the connection 
between microbes and disease. Prior to this time various diseases were attributed to 
evil spirits, bad blood, and even personality traits. Patients suffering from tuberculosis 
(TB), for example, were believed to be more sensitive and spiritual than were healthy 


123 


people. This belief led to the unusual 18 century fashion of otherwise healthy young 
women trying to appear sickly by purposely paling their skin. 


Robert Koch (1843-1910) pioneered the 
laboratory techniques for isolating different 
types of bacteria. In 1877 he successfully 
identified the bacterium that causes anthrax by 
examining blood samples from infected 
animals. In every blood sample, he found the 
same rod-shaped bacteria. He then grew 
colonies of this bacterium outside of an animal 
body. When he injected a healthy animal with 
the isolated colonies, the animal developed 
anthrax. Blood samples from the experimental 
animal showed large numbers of the bacterium, 
supporting the hypothesis that this was the 
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Figure 33. A microscope image of a 
colony of the bacteria responsible for 
causing tuberculosis. Credit: J. Haney 
Carr, Center for Disease Control. 


agent responsible for anthrax. Following his success with anthrax, Koch used similar 
techniques to identify the bacteria responsible for tuberculosis (1882; Figure 33) and 
cholera (1883), respectively (See Textbox: Koch Postulates). 


Koch’s Postulates 


In 1905 Robert Koch was awarded the Nobel Prize in Physiology/Medicine for his 

pioneering work in isolating and identifying the bacterium that causes tuberculosis, 
Mycobacterium tuberculosis. Koch is equally well known for articulating the steps 

necessary in identifying agents of disease. These steps are called Koch’s 


Postulates: 


1) For a particular microbial agent to be responsible for a disease, it must be found in 
every diseased individual examined and be absent from healthy individuals. 


2) The microbial agent must be isolated and grown outside of a patient's body. Today, 
microbes typically are grown on agar petri plates; in Koch’s early experiments, he used 


slices of potatoes as the growth medium. 


3) When the microbial agent is introduced to a healthy animal, the animal will develop 


the disease. 


4) The same microbial agent can then be re-isolated from the experimental animal. 


Koch isolated the microbes present in diseased patients by smearing samples 
taken from the patients onto a nutritive flat surface, such as a slice of potato. He then 
waited for distinct visible clusters, called colonies, to form. These colonies differ in 
physical appearance depending on the type of microbe they contain. Each colony is 
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descended from a single cell that was smeared on the nutritive surface. In the presence 
of these abundant nutrients, the single cell divides into two new cells, which in turn 
each divide into two more new cells, and so on. Division can occur as frequently as 
every 20 minutes under optimal conditions. 


Bacterial Growth: You smear a sample known to contain microbes onto a nutritive 
surface at 7:00 pm Tuesday evening. One of the bacterial cells on the surface divides 
every 20 minutes. You return to the laboratory at 7:00 am Wednesday morning and find 
the bacteria have divided enough to form a visible colony. How many cells are in the 
colony on Wednesday morning? 


Solution: Similar to our previous calculations of exponential decay, these bacteria are 
growing exponentially. Thus, we will use the calculation for exponential growth: 


Q: = Qo 2t/T 


We started with just one cell so Qo = 1. The bacteria can divide every 20 minutes, so T = 
20 minutes. You return after 12 hours = 720 minutes (Remember we need to keep units 
consistent! ). 


Q = Qu2"/7 
= (1)2720/20 
= 6.87 X 10 or 
= 68,700,000,000 bacterial cells. 


11.2. Pathogenic Bacteria 


As noted previously, the vast majority of bacteria are beneficial to humans and 
the Earth’s ecosystems. Only a few bacteria cause disease, but these pathogenic bacteria 
do a lot of damage: pathogenic bacteria are thought to cause approximately 50% of all 
human diseases. 


More than 1000 bacterial species have been identified as part of the human 
normal flora. Staphylococcus aureus is among the more common bacteria and is found on 
our skin, mucous membranes, mouths, and gastrointestinal tract. Thus, it might be 
surprising to learn to that S. aureus can cause a wide range of diseases including toxic 
shock syndrome, childbed fever, some forms of pneumonia, and necrotizing fasciitis 
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(a.k.a., ‘flesh-eating’ disease). How can such a potentially dangerous pathogen be living 
all over our bodies with no apparent affect? The answer is “location, location, location”. 
When our normal flora bacteria are located where they are meant to be, they are 
beneficial to us; however, when those same bacteria are introduced elsewhere, the effect 
can be devastating. For example, bacteria living harmlessly on the surface of our skin 
can cause septicaemia if introduced to the bloodstream. The term for such an infection 
is ‘ectopic’, which means ‘out of place’. In addition to ectopic infections, some species of 
bacteria are always pathogenic. One of the most notorious is Vibrio cholera, the 
bacterium that causes cholera (See Textbox: John Snow and Cholera). 


John Snow and Cholera 


Nineteenth century England was plagued by cholera epidemics. Cholera is thought to have 
originated in south Asia, particularly in the river deltas of India and the area that is now 
Bangladesh. Thus, the frequent merchant ships of the East India Trading Company returned to 
England with much more than just tea! John Snow, a physician practicing in London during this 
time, witnessed several cholera outbreaks. Applying logical thinking to his observations of 
cholera epidemics, he questioned the prevailing belief that cholera was caused by “miasma” or 
“bad (smelly) air’. 


Let’s follow his logic: 
Air can cause diseases. 
(Premise assumed to be true absent any reason not to think so) 


People who do not get cholera tend to live in places with fresh air. 
(A set of observations) 


People who breathe foul air soon get cholera 
(Another set of observations) 


Therefore, cholera is caused by foul air. 
(A valid conclusion to the argument. A true conclusion if the premise is true) 


To continue with this example — new observations clearly demonstrated the premise above is not 
true, therefore the conclusion above, while valid, must be acknowledged to not be true. Hence we 
need a new premise and a new argument, in order to draw a new conclusion — we are looking for the 
truth about cholera. 


Perhaps the new observations were: people who have never been exposed to foul air are also 
seen to be sick with cholera; some people who breathe foul air are not sick with cholera; people sick 
with cholera have all consumed water from one particular well. New premise: something other than 
foul air causes cholera — perhaps water from one well causes cholera. 


And so the theory of what causes cholera was found to be unsound in the face of new evidence. 
A new theory was proposed (cholera is caused by water from one particular well) and can be tested 
both deliberately and methodically, or can be put to the test by new randomly gained pieces of 
evidence. 


Snow followed this process and concluded that cholera instead must be a waterborne disease. 
His work resulted in improved water sanitation throughout England, and a halt to the cholera 
epidemics in that country. This is how science works; this is a necessary part of the scientific process, 
the scientific method. 
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V. cholera typically is a marine species of bacteria but also has the ability to 


survive well in freshwater. Shellfish including oysters, mussels, and shrimp are thought 


to be the natural reservoir of this bacterium. Cholera gets into the human population 
when people eat contaminated raw shellfish or when the bacterium makes its way into 
drinking water supplies. From that point, the highly contagious cholera is passed on 


from one person to the next. 


11.3. Parasitic Protists 


“Protist” is a term applied to those Eukaryotic organisms that cannot be readily 
classified as plants, animals, or fungi. These protists can be single-celled or multi-celled 
and make their living through photosynthesis, parasitism, or consumption of other 
organisms. Single-celled parasitic protists live within another organism’s body and, in 


doing so, cause disease. 


Giardia (a member of the Excavata) and 
Cryptosporidium (a member of the Ampicomplexa) are 
two of the more common parasitic protists that 
Canadians may encounter (See Textboxes: 
Cryptospordiosis Comes to Saskatchewan and The 
1980's and Giardia, respectively). The diseases caused by 
these organisms are called Giardiasis (also known as 
“Beaver Fever”) and Cryptosporidiosis, respectively. 
Both parasites are acquired by drinking contaminated 
water and both cause severe abdominal cramping, 
diarrhea and/or vomiting, and headaches. Patients who 
are under the age of two or who are already sick, are in 
danger of potentially fatal dehydration. Other 
mammals, such as cattle, dogs, and beavers can carry 
Giardia and Cryptosporidium, and it is thought that water 
contamination is often via these animals (Figure 34). The 
best way to prevent these diseases is adequate water 
treatment of municipal drinking water supplies. 


Figure 34. Although Giardia is 
sometimes called ‘Beaver Fever’ 
many mammals can spread this 
parasite including dogs. This is one 
of the reasons why you are not 
allowed to let your dog swim in the 
Glenmore Reservoir in Calgary. The 
reservoir provides drinking water to 
500,000 residents. 
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11.4. Disease-causing Fungi 


As in the case of bacteria, the majority of fungi are beneficial. More than 80% of 
all plant species enhance their nutrient uptake via a symbiotic relationship with fungi 
that live in the soil (Russell et al., 2010). These relationships, called mycorrhizal 
associations, are so common that some biologists hypothesize that plants and fungi 
must have colonized land masses simultaneously. Many plants, in fact, cannot live 
without their fungal symbionts. Fungi are also our primary source of antibiotics and 
one group of fungi, the yeasts, are used in making breads and beer. Nonetheless, some 
fungi can be pathogenic. 


A fungal infection is referred to as a mycosis. The vast majority (approximately 
90%) of these infections develop in the top layer of the skin. “Athlete’s foot”, 
“ringworm”, and “jock itch” are some of the common names for one of the most 
common types of skin mycosis: Tinea (Mayo Clinic, 2011). Although unsightly and 
uncomfortable (a skin mycosis is typically itchy), Tinea infections are usually considered 
non-life threatening. Another fungus, the opportunistic yeast, Candida albicans, resides 
as part of the human normal flora in the mouth, intestinal tract, and vagina. Typically, 
the Candida population is kept under control through competition from normal flora 
bacteria and causes no harm to its human host. However, if the bacterial population 
declines (e.g., as a result of antibiotic use) or if the person is immune-compromised (e.g., 
has had an organ transplant, has HIV-AIDS, or has recently undergone chemotherapy 
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for cancer) Candida quickly multiples resulting in thrush (yeast-mouth infection) or 
vaginal yeast infections (Medical Dictionary, 2011). 


Some of the most serious fungal infections are those caused as a result of 
exposure to “moulds” (See Textbox: Grow-Op Mould). “Mould” is the common name 
used to describe to a number of filamentous fungi. Inhalation of mould spores can 
cause asthma-related respiratory symptoms such as coughing and wheezing, as well as 
irritation to the mucous membranes of the eyes, nose, mouth, and throat (Health 
Canada, 2007). Aspergillus, one of the most common types of mould, is virtually 
impossible to avoid. Its spores are common in dust, soil, on plants, and in decaying 
plant matter, and most of us breathe in small doses of these spores everyday without 
issue (Centres for Disease Controls, 2010). However, inhalation of high doses of spores 
can result is the lung disease Aspergillosis (Centres for Disease Controls, 2010). The 
genus Stachybotrys includes “black mould” (S. charatum), one of several factors 
implicated in “sick building syndrome” (Medical Dictionary, 2011a). Stachybotrys 
produces mycotoxins, present on the spore surface, which may increase the chance of an 
adverse reaction if inhaled (Volk and Zitomer, 2002). 


Stachybotrys — the Grow-Op Mould 


Moulds like moist environments in which to grow. Despite our best efforts at cleanliness, the 
chances are most homes will have some mould growth, typically in the bathrooms or in damp corners 
of the basement. These common house moulds are not difficult to kill — hydrogen peroxide or bleach 
will do the trick. We may even encounter the notorious Sfachybotrys in ourown homes. Stachybotrys 
can digest the cellulose in drywall. Chipped paint or peeling wallpaper that exposes the drywall 
underneath, coupled with high moisture levels, creates the perfect environment for Stachybotrys. 
Although it has quite a nasty reputation, small areas (<1 m2) of black mould can be killed with bleach 
and the area repainted to keep it at bay. Persistent mould growth means there is a persistent moisture 


issue, perhaps arising from a leak, poor ventilation, or humidity levels that are too high. 


Stachybotrys is one of the reasons that real estate agents in many jurisdictions are required by law to 
disclose whether a home was formerly used as a marijuana grow-op. Grow-ops essentially turn homes 
into greenhouses with high temperatures and high humidity so that the tropical plants can grow. 
Generally the operators do not spend much time worrying about home repairs, such as painting over 
areas of exposed drywall or cleaning up small patches of mould growth. By the time the home is being 
sold (often because the operators have moved to the ‘Big House’), the former grow-op has so much 
mould growth that the only option is to gut it or knock it down. 
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11.5. Viruses 


Viruses are much more simple than bacteria. They are composed of a genome of 
one of more strands of DNA or RNA (See Textbox: This Virus is Retro), a protective 
protein coat, and sometimes a lipid envelope. The only way that viruses can reproduce 


is by parasitizing a living cell. The virus alights 
on the surface of the cell and its genetic content 
enters into the cytoplasm. If the virus has a lytic 
life cycle, the cell’s ribosomes, which synthesize 
proteins, are “tricked” into replicating the viral 
genes and proteins. These components are 
assembled into new viruses that “burst” out of 
the cell, destroying the host upon their exit 
(Figure 35). Other viruses don’t burst out, but 
instead “bud” off the host cell. In doing so, they 
take part of the host’s plasma membrane with 
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Figure 35. Viruses use the machinery of 
other organisms (a bacterium is pictured 
here) to complete its life cycle and produce 
multiple copies of itself. 


them. This essentially “cloaks” 
them and makes it more difficult for 
the next host cell to detect them as 
viruses. 


Viruses with a lysogenic life cycle 
reproduce in the cell’s nucleus. 
These viruses insert their DNA into 
the host DNA via recombination . 
Each time the host DNA is 
replicated, so is the virus DNA. 
This has the potential to scramble 
the genetic instructions of the host 
can and can cause some types of 
cancer. (See Textbox: Viruses that 
cause cancer). Lysogenic viruses 
remain dormant within the host 
DNA for a period time, passively 
allowing the host cell to replicate 
their genetic material for them. 
Periodically, some environmental 
cue will induce these viruses to start 
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assembling new viruses. At this point, the virus may switch to a lytic life cycle and the 
viruses are released from the cell to infect new cells. The viruses then return to a 
lysogenic life cycle. 


Viruses That Cause Cancers 


We typically don’t think of cancers when we think about infectious diseases. That's because 
most cancers are neither contagious nor caused by infectious agents. However, a small minority of 
cancers (approximately 10%) are caused or promoted by viruses and thus, are infectious. The best 
known of these oncoviruses (cancer-causing viruses) is the human papilloma virus (HPV) that causes 
approximately 70% of all cervical cancers. Other notable oncoviruses include hepatitis B and C 
(cause some liver cancers) and Epstein-Barr virus (causes some lymphomas). “Cancer” is a term 
given to more than 200 different diseases, all characterized by a malfunction in the cell cycle control 
system: the cells keep dividing when they should have stopped. How could a virus cause such a thing 
to happen? The genes encoded in the cell’s DNA include specific instructions about when to start 
dividing and when to stop. If a virus with a lysogenic cycle inserts its DNA into the host DNA at just the 
right place, these instructions are scrambled and cell keeps on dividing long after it should have 
stopped. 


Another way that viruses can cause cancer is if they require an actively dividing cell in which 
to replicate. The virus may encode its own proteins that push the cell through the cell cycle when it 
might not normally do so. Sometimes these viral proteins can go overboard, though, and by pushing 
the cell through the cell cycle uncontrolled, can lead to cancerous growth. 


Recently, viruses have been investigated as a way to CURE cancer! A Canadian research, Dr. 
Patrick Lee, has pioneered the field of "oncolytic viruses", so called because cancer cells (onco-) are 
killed (lytic) by them. Reovirus and a few other viruses that are medically benign, are being 
investigated for their ability to kill cancer cells, based on the expression of particular proteins in cancer 
cells. For example, some cancers express an activated form of the protein called Ras at very high 
levels, while normal cells do not express the activated form. Reoviruses are able to infect and kill cells 
that express the activated Ras protein, thereby targeting just the cancer cells and not the healthy, 
normal cells. Oncolytic viruses hold promise for enhancing our arsenal against cancer. 


Cold sores are caused by a virus with a lysogenic life cycle. The virus 
responsible is the herpes simplex virus type 1 or HSV-1 (Mayo Clinic, 2011a). Most of 
the time, this virus remains dormant within the cells of the nervous system. However, 
physiological stress to the host, such as another virus (e.g., the common cold) or skin 
damage (e.g., sunburn), can activate the HSV-1 to assemble new viruses and the result is 
annoying and sometimes painful blisters around the mouth and lips (“cold sores”). 
Infection by HSV-1 usually occurs in childhood and once you harbour this lysogenic 
virus you have it for life. 


Lysogenic viral cycles have also been suggested as a factor in several 
autoimmune diseases including multiple sclerosis, rheumatoid arthritis, and Type I 
(child onset) diabetes. Patients often recall suffering from a virus (usually dismissed as 
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influenza) just prior to the onset of their autoimmune disease. Thus, it has been 
hypothesized the patient’s immune system may attack its own cells because it 
recognizes the viral DNA embedded among the host DNA. If this is the case, then the 
possibility exists that there may one day be vaccinations available to prevent these 
debilitating diseases. 
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Chapter 12. Prevention and Treatment of Infectious 
Disease 


12.1. Antibiotics: “Miracle Drugs” 


Robert Wadlow (1918-1940) holds the record as the tallest person to have ever 
lived (Guinness World Records, 2011). A week before his death he measured an 
astonishing 2.72 m (8’ 11.1”), and his doctors believed he was still growing at that time. 
Robert Wadlow died in 1940 at the age of 22. What was the cause of his death? A blister 
on his ankle became infected (Guinness World Records, 2011). Antibiotics were 
introduced to the marketplace during the 1940’s not long after Mr. Wadlow’s death. 
Had they been available when his ankle became infected, there is a good chance Robert 
Wadlow would have survived the infection. 


Antibiotics are effective against only one type of infectious agent — bacteria. 
However, because bacteria are responsible for approximately 50% of all human 
diseases, it is not surprising that antibiotics were soon dubbed the “miracle drugs”. 
Now the medical community had a weapon against some of humanity’s worst killers 
including tuberculosis, bubonic (black) plague, and tetanus. Infected blisters were no 
longer a matter of life and death. 


Bacterial cells and human cells are very different in structure and biochemistry. 
Antibiotics target features of bacteria that our own cells lack. For example, bacterial cells 
have a cell wall surrounding the plasma membrane whereas human cells just have a 
plasma membrane. Penicillins work by inhibiting peptidoglycan, a common component 
of bacterial cell walls. Thus, taking Penicillin will destroy the bacteria in your body but 
will leave your own cells unharmed. In addition to the large number of diseases that 
can be treated with antibiotics, another advantage of these drugs is their relatively low 
cost of production, which makes them available to a greater number of patients around 
the world. 


Have you ever wondered why doctors always stress that you must take the 
entire dose of antibiotics and that you should take the drugs at the same time every 
day? One reason (among others) is that for some antibiotics, your kidneys will break 
down the antibiotics in your bloodstream. The antibiotic Gentamicin© is used to treat 
infections caused by E. coli and Staphylococcus bacteria. This antibiotic is cleared from 
the blood at a rate proportional to its concentration. The kidneys are removing a 
fraction of the Gentamicin in the blood at all times. As the level of the drug diminishes 
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there is less Gentamicin presented to the kidneys and therefore less is excreted. As a 
result the concentration of Gentamicin in the blood is declining exponentially. 


Antibiotic Concentration: Suppose you are administered Gentamicin and the resulting 
concentration of the drug in your blood is 7.5 mg/L. In two hours, the drug 
concentration in your blood is down to 4 mg/L. Determine the half-life of the drug in 
the body. 


Solution: 
The model for the drug concentration in the body is given by 
Q = Qo: (0.5)!/7 
with an unknown half-life T (in hours). 
Now we solve for T. We know 
4=7.5- (0.5)2/T 
4/7.5 = (0.5)2/T 
0.5333 = (0.5)2/T 


Using the rule of logs, we can lower the exponents so the equation can be 
rewritten as: 


2/T = logo.5(0.5333) 
2 = T logos(0.5333) 
2 


Tan — =- — 
log, s (0.5333) 
2 
~Tog(0.5333) 
log(0.5) 
_ 2log(0.5) 
~ log(0.5333) 
= 2.2051 hours 


which is 2 hours, 12 minutes and 22 seconds. 


It could be very useful to know the half life of Gentamicin concentration in your 
blood. Suppose you are taking injections of this antibiotic four times a day. The first 
injection results in Gentamicin blood concentration of 7.5 mg/L. The next injection will 
obviously result in a slightly higher concentration due to some Gentamicin remaining in 
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your blood from the first shot. Since you take the shot every 6 hours and we have 
found the half life of 

Gentamicin to be T = 2.2 hours (we use one decimal precision), the remaining 
Gentamicin concentration amount from the first shot can be determined by: 


Qo (0.5)6/T = 7.5 (0.5)6/22 = 1.1 mg/L. 


So when you take the second shot, it will bring the Gentamicin concentration up 
to 1.1mg/L+7.5mg/L=8.6 mg/L. After another 6 hours elapse, we will have the 
following Gentamicin concentration left from the first two shots: 

Q = Qo (0.5)6/T 

Q = 8.6 mg/L (0.5) 6/22 = 1.3 mg/L 
and after the third shot you will have 1.3 mg/L + 7.5 mg/L = 8.8 mg/L of Gentamicin in 
your blood. 


We seem to be getting higher and higher Gentamicin concentrations with each 
new shot but the increases seem to dissipate. The increase of Gentamicin in your blood 
during the next shot would not even register at the first decimal place. If you keep 
taking the drug for a long time it can be shown that the highest Gentamicin 
concentration in your blood would never exceed 8.84 mg/L, no matter for how long 
you take the drug. 


12.2. Antiviral Medications 


Treating viral infections is more complicated than treating bacterial infections. 
This is because viruses are not cell-based and replicate themselves within a host cell. 
Antiviral medications target specific viral genes or proteins with the aim to disrupt the 
virus life cycle. For example, two antiviral medications used to treat severe cases of 
influenza, Relenza© and Tamiflu©, prevent the virus from gaining entry into the next 
cell. Both of these antivirals reduce the duration of influenza by about 1-1.5 days and 
can prevent development of flu symptoms in those who have had contact with an 
influenza patient (Mayo Clinic, 2011b & 2011c). Antiviral medications tend to be costly 
and have several side effects associated with them. Thus, they are usually reserved for 
only the most serious of viral infections. 
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12.3. Hand-washing 


During the H1N1 influenza outbreak of 2009, public washrooms across Canada 
were donned with posters describing proper hand-washing techniques to prevent the 
spread of the virus. Preschool and kindergarten children (who could not yet read the 
posters) were instructed to wash their hands with soap while singing the alphabet song 
twice: this would insure that their hands were thoroughly clean. The idea that our 
hands can be the vectors that spread disease is traced to a 19th century physician, Ignaz 
Semmelweis (1818-1865; Hill 2006). 


Semmelweis was teaching obstetrics in a hospital in Vienna when he noticed 
something curious: women whose babies were delivered by medical students at the 
hospital were much more likely to die of childbed fever than women whose babies were 
delivered by midwives at home. In fact, the mortality rate of women attended by 
medical students was 20 times greater than the other women! What were the medical 
students and midwives doing differently? The midwives only attended women 
labouring and delivering. The medical students were learning all aspects of medical 
practice and often attended women directly after handling cadavers during autopsies. 
Although Semmelweis had no knowledge of disease-causing microbes, he hypothesized 
that the medical students must be transferring deadly “cadaver particles” to the 
women. He started requiring that the medical students wash their hands in chlorinated 
lime water before attending to the women. The mortality rate of women attended by 
the medical students declined from 18.3% to 1.3% (Figure 36). 
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Figure 36. Monthly mortality rates from puerperal fever for women giving 
birth at the first clinic at the Vienna General Hospital 1841-1849, as reported 
by Semmelweis. He devised and instituted a chlorine handwashing policy in 
mid-May 1847. 
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Childbed fever, also known as puerperal fever, is caused by Streptococcus, a genus 
of bacteria that includes some of the most numerous and common bacteria living on our 
skin. It is an example of an ectopic infection: the bacteria do no harm living on our skin 
but can cause septicaemia of the genital or urinary tracts. Childbed fever is now rare 
and women who do contract this type of infection following childbirth can be treated 
with antibiotics. Today, the greatest single risk of developing childbed fever is delivery 
via a Caesarean section. Meanwhile, hand washing is still a cornerstone in health care 
and disease prevention. Hospitals and medical clinics across Canada are equipped with 
alcohol hand-sanitizer stations to reduce the chance of infection being inadvertently 
transferred from one patient to the next, and the alphabet song rings out from preschool 
washrooms across the country. 


12.4. Vaccinations 


No other medical advance has saved more lives than vaccinations. The idea 
behind vaccinations is both simple and elegant: use the bodies’ own immune system to 
fight deadly diseases. 


Our immune systems are truly amazing! When a microbe invades our bodies, 
the immune system identifies that it has encountered “foreign proteins” and launches 
an offensive against the invader. Many of the symptoms associated with disease are 
actually part of the immune response. For example, an elevated body temperature 
(a.k.a., a fever) is an immune system response that helps to denature the proteins of the 
foreign microbe and mobilize the various types of white blood cells involved in 
defending the body against the invader. For this reason, many physicians suggest 
fever-reducing medications only in the case of high-grade fevers (above 40° C). Innate 
immunity, such as fevers, are nonspecific defences that are always available to fight a 
wide variety of infectious disease. 


Our immune systems also have the ability to learn to recognize specific microbes. 
This is referred to as acquired immunity. Acquired immunity is based on past 
experience — the immune system needs to have previously encountered the microbe to 
be able to defend against it. On surface of the microbe are molecules called antigens 
that elicit the immune system to produce antibodies. Antibodies are proteins that attach 
to the antigens and immobilize the microbe. Upon first encounter with a specific 
microbe, it typically takes the immune system 10-14 days to produce these antibodies. 
However, when that microbe is encountered a second time, the immune response is so 
fast the person typically experiences no symptoms whatsoever. 
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Unfortunately, some infectious diseases are so deadly that the immune system 
does not get a second chance: the first encounter with the microbe kills the patient. This 
is where vaccinations can make a difference. A vaccine is a dead or disabled microbe 
that is introduced into the body. It can do no harm to the patient, but it still has the 
surface antigens of its species. Over the next 10-14 days, the immune system produces 
antibodies in response to these antigens. When a live version of the microbe is later 
encountered, the antibodies are ready and the vaccinated individual does not become 
ill. Childhood mortality rates have dropped tremendously as a result of vaccination 
programs. 


Some people still erroneously question the need for vaccinations and now it is 
common to hear individuals debating their use. In many cases, the debate centers on the 
link to autism. There is no scientific evidence to support a link between autism and 
vaccination but this misinformation is still perpetuated. But where did this idea come 
from? 


12.4.a. The Autism-Vaccine Debate 


In 1998, gastroenterologist Andrew Wakefield and several colleagues published 
an astonishing study in the medical journal the Lancet. A study involving 12 autistic 
children with gastrointestinal problems suggested both the autism and the digestive 
problems arose shortly after the children had received the MMR (measles, mumps, and 
rubella) vaccination (Wakefield et al., 1998). Science requires that results can be 
replicated by an independent group of researchers before any conclusions are drawn. 
However, in this case, Andrew Wakefield presented his findings at a press conference 
before his results could be confirmed and the general public panicked. Childhood 
vaccination rates plummeted in the U.K., United States, and Canada and outbreaks of 
measles and other preventable diseases increased. 


Had the public waited before acting, they may not have been so hasty to accept 
Andrew Wakefield’s claims. Study after study failed to confirm any link between 
autism and vaccinations. One of the most thorough investigations by Danish 
epidemiologists included a sample size of 537,303 children (Madsen et al., 2002). Once 
again, no connection between vaccinations and autism could be found. Despite 
evidence to the contrary, some parents remained convinced that vaccinations caused 
more harm than good. 


Meanwhile, a British investigative reporter was becoming increasingly curious 
about the original Wakefield study. Brian Deer’s specialty was journalism on medical 
and drug industry topics. He had previously uncovered fraudulent drug industry 
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claims regarding the safety of certain pharmaceuticals. A study questioning the safety 
of vaccinations was right up Deer’s alley. This time, however, Deer did not find any 
evidence of a drug industry covered up; instead he found that Dr. Wakefield had 
misreported much of the data in the original paper. For example, some of the children 
were diagnosed with developmental issues before receiving the MMR vaccine, not after, 
and some of the children did not even have autism (Deer, 2011). Moreover, Andrew 
Wakefield was being paid by lawyer Richard Barr to attack the safety of the MMR 
vaccination so that Barr could launch a class-action suit against the manufacturer of the 
vaccine. By 1996, Wakefield was on Barr’s payroll, and Barr was using money from the 
publicly funded U.K. legal aid fund to pay him. In June 1997, Wakefield filed for a 
patent on a new measles vaccination — a vaccination that would be profitable only if a 
replacement was needed for the MMR vaccination (Deer, 2011). 


By 2004, the journal in which Wakefield et al.’s (1998) paper had been published, 
the Lancet, retracted the paper stating that the study never should have been accepted 
for publication. Finally, in 2010 the British Medical Council stripped Andrew Wakefield 
of his license to practice medicine. 


12.4.b. Vaccine Compliance and Herd Immunity 


Not all people can be vaccinated. Some individuals are allergic to components of 
vaccinations, others have diseases that leave them immune-compromised, and a few fail 
to develop antibodies even though they have been vaccinated. These individuals rely 
on the vaccinated people in the population to prevent disease outbreaks. When vaccine 
compliance drops below levels necessary for herd immunity, outbreaks occur and these 
vulnerable people get sick. 


During the Spring of 2011, both the U.K. and France were in the midst of a 
measles outbreak. The outbreak spread to Quebec, and by early June more than 250 
confirmed cases had been documented in that province, most cases traceable to people 
who had recently travelled in France (Picard, 2011). 


Herd immunity, a term that was first used in veterinary circles, occurs when 
most individuals in a population are immune to a disease and even those without 
immunity are safe from disease outbreak. The percentage of immune individuals 
needed to achieve herd immunity varies from disease to disease, depending on how the 
infectious disease is spread, how contagious it is, and how quickly it reproduces. The 
outbreak of measles in Europe and Canada in Spring 2011 occurred because herd 
immunity had been lost: too few people are being vaccinated against measles. 
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Viral infections can cause epidemics and mathematics can be used to analyze 
their spread. Each virus is assigned a contagious value to measure how quickly it 
spreads. Contagious value can also be called the infection rate; we’ll use p to denote this 
value. Measles is contagious with a number between 16 and 18, whereas the H1N1 
virus is estimated to be somewhere between 1.2 and 1.6. The common influenza virus 
has the contagious number somewhere between 1.5 and 3. 


These numbers can be interpreted as follows. Choose a unit of time, say one day. 
The infection rate of 1.6 means that one person can infect 1.6 new people in one day (p = 
1.6). Naturally what we mean by this is that 10 people, for example, can infect 16 new 
people over the course of one day. On the next day each person in the group comprising 
the 1.6 people, infect another 1.6 people. We now have 1.62 = 2.56 new people infected 
during the second day. Naturally, what we mean by this is that the original 10 people 
led to the infection of 25 new people on the second day (well 25.6, but you can’t have 0.6 
of a person!). 


Now assume we declare an epidemic if the new cases on each day are increasing 
over time and are not controlled. On the other hand, if the number of new cases 
decreases over time, then we say we do not have an epidemic. This can happen as 
follows. Suppose we manage to reduce the contagious rate, p, for the disease below the 
threshold value of one; thus, we have p < 1. Then the number of new cases on the first 
day is p, on the second day it is p2, on the nth day the amount of new cases is p”. For an 
infection rate (contagious value) of less than 1.0, as time progresses the number of new 
cases becomes smaller and smaller. Thus working to bring about lower contagious 
values is our goal; lower rates contain or halt epidemics. 


For example if p = 0.9 then the number of new cases on the 7t day is 0.97 = 
0.4783. On the other hand, if contagious values are greater than one, p > 1, the number 
of new cases will increase. For example, if p = 1.6 then the number of new cases on the 
7th day is 1.67 = 26.8435. All of these cases are measured as new cases stemming out of 
the original one person having the disease. Therefore it is imperative we reduce the 
contagious value for the given disease below the threshold value of one. Fortunately, this can be 
accomplished through immunization with vaccines, among other mechanisms 
(handwashing, etc.). 


Immunization vaccinates people against disease. In practice, however, not all 
people get vaccinated. Suppose for the sake of argument we assume that 40% of all 
people get the H1N1 flu shot. Suppose an infected person passes on the H1N1 virus to 
1.6 people on the first day (we always use the higher estimate of the contagious value). 
Fortunately, out of those 1.6 people only 60% develop the flu (percentage of those not 
vaccinated) and we only have 
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(1.6)(0.6) = 0.96 new cases of the flu on the first day. 


The effective contagious value has now been reduced to p = 0.96, which is less than our 
threshold of 1 (p < 1) and we avoid an epidemic — though it would be nicer to have 
smaller contagious value so that fewer people become infected. 


H1N1 Vaccine Compliance: What should be the minimal compliance rate (the minimum 
percent of people vaccinated) to avoid an H1N1 epidemic? The contagion for value for 
H1N1 is p = 1.6 


Recall that an epidemic — the uncontrolled, exponential spread of a particular 
disease in a population — occurs when the number of new cases appearing each day is 
increasing over time. To prevent an epidemic, then, the spread of the disease must be 
controlled. A person can only contract the disease - and become a new case - upon 
encountering a carrier if that person is not already immune. If the person has already 
had the disease, and it was not fatal, then they will not become re-infected because they 
will be immune. One way that the spread of a disease can be controlled is to wait until 
sufficient people have contracted it, and thus are immune to the disease. Left alone, 
the disease will eventually control itself. Obviously, this is not a desirable approach 
because people will still become ill in the meantime. 


Solution: 


A disease can be controlled and an epidemic prevented if sufficient numbers of 
the population are immunized against the disease and thus will not become infected 
upon encountering a carrier. For H1N1 with a contagion value of 1.6, this means that 
one infected person can infect 1.6 new, non-immune people per day, or 10 infected 
people can infect 16, or 100 infected people can infect 160. 


If the spread of the disease is to be controlled, then we must control the number 
of new cases. This means that the number of new cases each day must not increase, so at 
most one person can infect one new person. Having no new cases means that the 
disease is not increasing in the population - it is still spreading though. When we scale 
up, 100 infected people can at most infect 100 new people. This means that 60 of the 160 
who would normally become infected due to the presence of 100 infected people in the 
population need to be immune (vaccinated) so they do not contract the disease. The 
minimal vaccination rate therefore needs to be (60/160) X 100% = 37.5%. As long as 
37.5% of the population is vaccinated against H1N1, the growth of the disease in the 
population will be controlled. The level of disease will remain constant until and unless 
more than 37.5% of the population become immune. 


We can also calculate this same value using the following approach. 
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Solution 2: 

Here x is the fraction of people not vaccinated. If x = 1 (100% not vaccinated), 
then 1 person infects 1.6 people in one day. In order for one person to infect one person 
per day (i.e., a replacement and not a spread of the disease) then p =1, what percent are 
not vaccinated? So we need to solve 


1 
x=— 
1.6 
= 0.625 


Thus we need at least 100 (1 — 0.625) = 37.5% of all people to be vaccinated in order to 
avoid a H1N1 virus epidemic. 


Influenza Vaccine Compliance: Find the minimal compliance immunization rate needed 
to avoid an epidemic of the common influenza virus. Common influenza has a 
contagious number between 1.5 and 3. 


142 


Chapter 13. What are Emerging Infectious Diseases? 


In late March and early April of 2009, the US-based Centers for Disease Control 
and Prevention (CDC) first identified a new flu virus in patients living in the border 
cities of San Diego, CA and San Antonio, TX. Genetic analysis of the virus showed that 
it must have been circulating in the human population since at least January 2009. Most 
cases were relatively mild, although this strain of flu was accompanied by more nausea 
and diarrhea than is typical of influenza. By May the virus had infected more than 900 
people in living in Mexico and had contributed to at least 18 deaths (New Scientist, 
2009). Influenza deaths are not uncommon, but it is typically the elderly, the chronically 
ill, or infants who are the victims of flu-related mortality. This flu was claiming a very 
different type of victim: relatively young, healthy individuals. During the 2009 
outbreak, the mean and median ages of people dying in the US from this disease were 
40 and 43, respectively (CDC, 2010a). Pregnant women and their unborn children were 
especially vulnerable. Another unusual aspect of this influenza was the timing: in the 
Northern Hemisphere the peak month for seasonal influenza is February (CDC, 2011), 
but this flu peaked in November (e.g., Ishikawa, 2010; See Textbox: Why do we get the 
flu in winter?). Originally referred to as “swine flu”, this influenza strain became better 
known by its official name: “Influenza A, H1N1 subtype”, or the shortened “H1N1”. 


Why do we get the flu in winter? 


Whether you live in the northern or southern hemisphere, flu season occurs during the 
winter. For northerners, peak flu month is February and for southerners it is August. However, you 
would think that high levels of global travel would alter those dates — after all, a virus causes 
influenza, not being out in the cold — or does it? Ever since the viruses responsible for influenza and 
the common cold were discovered, the idea that one could get sick from exposure to cold weather 
was dismissed as a myth or an old wives tale. Now scientists have found that there may be some 
truth in it. 


The influenza virus spreads via airborne droplets when an infected person sneezes or 
coughs. Lowen et al. (2007) conducted experiments on how relative humidity and air temperature 
affected the persistence and transmission of these droplets. Two winter conditions, cold outdoor 
temperatures and dry indoor air, turn out to be the perfect environment for spreading influenza. 
Firstly, cold outdoor temperatures affect the flow of mucus in your respiratory tract. The function of 
this mucus is to bind to contaminants and clear them from the body. When the temperature is cold, 
the mucus becomes too viscous (thick) to perform this function and the virus remains in the body 
longer. Secondly, the virus-containing droplets spread best in dry air, when relative humidity is 
between 20-30% (Lowen et al., 2007). Typically, heated indoor air is dry. Moreover, the dry air can 
further hamper mucus production and damage the tissues lining your nose and respiratory tract. 
Thus, while cold weather does not cause influenza (the virus does that), it does make you vulnerable 
to catching diseases. 
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Where did this new strain of influenza come from? New diseases do not 
spontaneously generate; they evolve from existing diseases. Influenza viruses of the H1 
family are known to have been the dominant influenza strains in humans prior to 1889. 
Since then viruses in the H2 and H3 families have become the more prevalent strains, 
although H1 influenzas have made a few notable comebacks, including the 1918 
Spanish flu pandemic that is estimated to have killed as many as 50 million people 
(New Scientist, 2009). 


The appearance of H1N1 in 2009 is an example of an emerging infectious disease. 


An emerging infectious disease is any disease that has increased in incidence in the 
human population over the past two decades and is likely to continue to increase in 
incidence in the near future (Hill, 2006). Other recent examples of emerging infectious 
diseases include SARS, HIV/AIDS, Lyme disease, West Nile virus, and antibiotic- 
resistant tuberculosis (TB). In this section, we will examine the factors that can result in 
a disease ‘emerging’. 


12.1. Mutations and the Evolution of Microbes 


New strains of bacteria, viruses, or other microbes evolve from existing ones. 
The start of this process is via a mutation in the genetic material of the microbe. A 
mutation is any change in the nucleotide sequence of the DNA (or RNA, in the case of 
RNA-based viruses). Despite the amazing variety of living organisms on the planet, the 
genetic blueprint of life is very similar from one organism to the next. DNA is 
composed of four nucleotides - adenine (A), cytosine (C), guanine (G), and thymine (T) 
— and these nucleotides instruct the cell as to which amino acids to string together to 
make proteins. Organisms vary from one another with regards to the sequence of 
nucleotides, not the types of nucleotides. Mutations occur when one nucleotide is 
substituted for another, one or more nucleotides are deleted from the sequence, or one 
or more nucleotides are inserted into the sequence. The more often a cell divides, the 
more likely it is that a mutation will occur. Microbes typically divide very rapidly, 
which is why they mutate so frequently. For example, E. coli bacteria growing in ideal 
conditions can divide once every 20 minutes. In addition, many microbes also have 
another way of changing their DNA sequence: they can acquire new genetic material 
from another microbe via a process called horizontal gene transfer (See Textbox: Do 
Bacteria have Sex?). The combination of frequent mutations and horizontal gene 
transfer provides the fuel for rapid evolution of microbes, and the potential for new 
diseases. 
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Do Bacteria have Sex? 


Bacteria are single-celled, prokaryotic organisms that reproduce via simple cell division. 
When a bacterial cell divides the result is two new cells, each identical to the original parent cell. 
Therein lies a problem for the bacteria: generation after generation will be clones of the first 
bacterial cell, barring any mutations that occur along the way. In contrast, every offspring produced 
by sexually reproducing organisms (such as ourselves) has a unique combination of genetic 
material. Genetic variation between individuals is a huge advantage if a population or species is to 
withstand changes in the environment. Is there any way that a single-celled, bacterium can reap 
the advantages of sexual reproduction? 


Horizontal gene transfer involves the transfer of genetic material from one bacterium to 
another. One type of horizontal gene transfer, called conjugation, is the bacterial equivalent of 
sexual reproduction. One bacterium, the donor, makes a cytoplasm bridge to another bacterium, 
the recipient. The donor replicates some or all of its’ genetic material, transfers it across the 
cytoplasm bridge to the recipient, and the two cells then separate. In its next encounter, the 
recipient, with its newly acquired genetic material, will become the donor to another bacterium. 
Horizontal gene transfer is one of the ways that antibiotic resistance can spread in a population of 
bacteria. 


The result of mutations and horizontal gene transfer is that individual microbes 
within a population vary from one another, and it is this variation that sets the stage for 
evolution. Biological evolution occurs whenever there is a change in the frequency of 
one gene variant relative to another within a population. If all individuals are clones of 
one another, each will possess exactly the same genes and the gene frequencies will 
remain the same from one generation to the next (i.e., the population will not evolve). 
The most common reason for gene frequencies to change in a microbial population is 
that there has been some change in the microbes’ environment. These environmental 
changes include an increase in the use of antibiotics or other antimicrobials, humans 
coming in close contact with wild or domestic animals, and alterations to the habitats 
such as deforestation or the damming of rivers. 


12.2. Resistance to Antimicrobial Drugs 


A microbiologist inoculates a petri plate with bacteria. After a day or so, there 
are millions of bacterial cells growing on the petri plate and chances are at least one of 
those cells has a mutation that makes it resistant to some type of antibiotic. However, 
the mutated gene is uncommon in the population and will remain so provided the 
bacteria are grown in an antibiotic-free petri plate. Change the environment by adding 
antibiotics, and the mutated gene increases in frequency. After several generations in 
this antibiotic-filled environment, virtually all bacteria in the population will have the 
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mutation (See Textbox: Tuberculosis Makes a Come Back). This simple scenario, an 
example of evolution by means of natural selection, is played out daily in Canadian 
hospitals. 


Tuberculosis Makes a Come Back 


In May of 2007, the Centers for Disease Control and Prevention (CDC) ordered the 
quarantine of a patient with tuberculosis (TB) for the first time since 1963 (Wilson, 2007). The 
patient had taken two international flights, exposing fellow air travellers to his disease. 
Tuberculosis, which is caused by the bacterium Mycobacterium tuberculosis, is one of the oldest 
and deadliest bacterial diseases in the human population. There is evidence of this disease 
claiming lives in ancient Egypt, China, and India. In mid-seventeenth century Europe tuberculosis 
had become the leading cause of death. The “Great White Plague” refers to a European epidemic 
of TB that lasted for more than 200 years (Colston, 1995). However, by the 1960’s, it appeared 
that modern medicine was close to eradicating tuberculosis, just as it was close to eradicating 
smallpox. But while smallpox was indeed successfully eradicated, tuberculosis instead made a 
spectacular comeback. By 1993, the resurgence of tuberculosis was declared a global health 
emergency by the World Health Organization. 


What went wrong in our attempt to eradicate TB? Simply put, we used the wrong tool. 
Smallpox was eradicated via mass vaccination programs. The attempt to eradicate TB was mainly 
via antibiotics, and that gave Mycobacterium tuberculosis the opportunity to evolve antibiotic 
resistance. A vaccination against TB did exist (the BCG vaccine), but it was only 60-80% effective 
in preventing the disease and was administered only in high risk communities or to health care 
workers, not to the general public at large. Today, as many as 3 billion people carry the bacterium 
that causes tuberculosis (Hill, 2006), and many of those patients harbour multidrug-resistant 
strains of the disease, as did the air traveller who found himself under quarantine in 2007. With 
almost 500,000 new cases of TB being diagnosed each year, the idea of eradicating this disease 
now seems far-fetched. However, there is hope on the horizon — new, highly effective 
vaccinations for TB are being developed. 


By necessity, hospitals are antibiotic-filled environments. Surgical equipment 
and surfaces are sterilized with antibacterial solutions; patients are given antibiotics 
post-surgery in case bacteria from the patient’s skin were inadvertently introduced into 
the site of the surgical wound. Antibiotics are everywhere and thus, non-resistant 
bacteria have very little chance of surviving in a Canadian hospital. However, hospitals 
are the perfect environment for antibiotic resistant bacteria and it was within hospitals 
that the first ‘superbug’, methicillin-resistant Staphylococcus aureus (MRSA), was 
discovered. 


MRSA is one of the most common nosocomial (hospital-acquired) infections. 
Despite a national surveillance program and efforts to reduce such infections, rates of 
MRSA increased by 10-fold in Canadian hospitals between 1995 and 2003 (Zoutman et 
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al., 2003). In fact, nosocomial infections have become the fourth leading cause of death 
among Canadians (Winnipeg Regional Health Authority, 2010). Can an understanding 
of evolution help us solve this problem? Many scientists believe it can. 


The key lies in changing the environment to one of few antibiotics. In the 
absence of antibiotics, the non-resistant strains typically out-compete the mutated, 
resistant bacteria. Simple changes, such as cleaning hospital surfaces with microfiber 
cloths and sterilizing surgical equipment via autoclaving (high pressure steam) can 
greatly reduce antibiotic use at hospitals. Outside hospitals, antibiotic use is highest in 
agriculture. Poultry farms, for example, have such dense flocks of chickens or turkeys 
that antibiotics are added to the poultry feed to prevent disease spread. Unfortunately, 
such practices greatly add to the problem of antibiotic-resistant bacteria. The medical 
community asserts that unless the agricultural industry abandons the practice of 
routinely giving healthy animals antibiotics, we will have no hope in reversing 
antibiotic resistance. However, bacteria are not the only microbes that can evolve drug 
resistance. Given the right conditions (i.e., frequent exposure) fungi evolve resistance 
against antifungal medications, viruses evolve resistance against antivirals, and 
Plamodium (the protozoan that causes malaria) evolves resistance against anti-malarial 
drugs. 


12.3. Zoonotic Diseases 


What do SARS, HIV / AIDS, lyme disease, and avian flu have in common? They 
are all examples of zoonoses, diseases that can be transferred between animals and 
humans. Approximately 75% of all emerging infectious diseases are zoonotic, and 
zoonoses are twice as likely to “emerge” than diseases that are specific to humans 
(Taylor et al., 2001). 


Humans are most likely to be exposed to zoonoses via domestic animals (See 
Textbox: Maybe chicken coops aren’t a great idea). For example, humans living in close 
contact with domestic swine or fowl are a major factor in influenza cycles. Let’s say a 
strain of influenza is circulating in the human population. A farmer contracts the flu 
and passes the virus on to one of his domestic animals. The influenza virus must now 
adapt to the pigs’ or chickens’ physiology and does so through mutation and 
reassortment. The domestic animal later passes the flu back to the farmer. Again, the 
virus must adapt to its new host, in this case to human physiology. By the time the 
influenza virus has completed the cycle back into the human population, it has changed 
(evolved) so much that it has become a new strain of the disease. People who had 
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contracted the flu last year have no immunity to this new version of the virus and can 
get the flu all over again. 


Maybe Backyard Chicken Coops aren’t such a Great Idea 


Canadian urbanites are growing their own food. In most cases this involves vegetable gardens 
and a few fruit-bearing shrubs or trees; more dedicated urban farmers are converting their lawns into 
mini-wheat fields (Stueck, 2011). However, the most controversial trend is the backyard chicken coop 
in which one or several laying hens are kept for eggs. More than 300 cities in North America have now 
changed their bylaws to accommodate urban chickens. The controversy mainly revolves around noise 
issues and most jurisdictions do not allow homeowners to keep roosters. But could backyard hens put 
their owners at risk of disease? 


Anyone who has ever kept a domestic bird (pet or otherwise) will be aware of how frequently 
birds pass feces each day, and it is the feces that can be of particular risk to humans. Zoonotic 
diseases are spread through avian feces (CDC, 2010c). Preventing a build-up of feces is essential in 
avoiding disease transfer, and this means cleaning the chicken coop frequently. Many of the diseases, 
including cryptococcal meningitis (Staples, 2011), Newcastle disease, and aspergillosis affect the 
respiratory system, so wearing a face mask and gloves while cleaning may be prudent (University of 
Arizona, 2002). 


Wild animals can also transmit zoonoses to the human population. The virus 
that causes SARS-Cov-1 (severe acute respiratory syndrome), for example, is very 
similar to a virus isolated in the Masked Palm Civet (Paguma larvata), a carnivorous cat- 
like mammal. Live civets often find themselves in the meat markets of Asia, and it is 
here that scientists hypothesize the virus made the jump from civets to humans. It is 
believed that the current SARS-Cov-2 (Coronavirus) also originated from an infected 
animal from a meat market. The natural reservoir of Hantavirus is the Deer Mouse 
(Peromyscus maniculatus), one of the most common native rodents in Alberta. 
Accumulation of Deer Mouse droppings in barns can result in humans contracting this 
often-fatal respiratory illness. HIV-1 and HIV-2 are similar to the Simian 
immunodeficiency virus isolated from chimpanzees (Pan troglodytes) and sooty 
mangabeys (Cercocebus atys), respectively. It is believed that the virus made the jump 
from these primates to humans when bushmeat hunters came into contact with their 
preys’ blood. 


Other diseases are considered zoonoses because they are spread via an animal 
vector. Mosquitoes are especially effective vectors, spreading the protozoan that causes 
malaria, as well as many viral diseases including West Nile virus, Dengue fever, Yellow 
fever, and various types of encephalitis. Bacterial diseases, such as Lyme disease and 
Rocky Mountain spotted fever, tend to be spread by ticks or lice, rather than 
mosquitoes. 
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12.4. Large-scale Changes to the Environment 


Large-scale changes to the environment are a major contributor to emerging 
infectious diseases (Sutherst, 2004). Typically, these changes either enable humans to 
encroach into the habitat of wild animals (and the zoonotic diseases they may carry) or 
these changes enable diseases to expand their range of distribution (See Textbox: The 
Unintended Consequence of Foreign Aid). 


The Unintended Consequences of Foreign Aid Projects 


The Aswan High Dam in Egypt was constructed on the Nile River during the 1960’s, with 
much of the funding provided as foreign aid from the former Soviet Union. The dam was greatly 
needed. Another dam (the Aswan Low Dam) downstream had nearly been breached in 1946 and 
chances were it would breach in the future if measures were not taken. The Aswan High Dam took a 
decade to build and fill the reservoir behind it. It stands more than 100 m high and the reservoir 
holds more than 100 cubic kilometres of water. 


Egypt has benefitted greatly from the dam. It provides much needed hydroelectricity to its 
growing population, irrigation for agriculture, and protection from the frequent droughts that plague so 
many African nations. However, an unintended consequence of the Aswan High Dam was an 
increase in the incidence of schistosomiasis, a parasitic infection caused by trematode flatworms. 
These parasites have a complex life cycle. Eggs are released into the environment and hatch upon 
contact with freshwater. These larvae, called miricidia, infect freshwater snails and reproduce 
asexually resulting in another type of larvae called cercariae. The cercariae infect humans, migrate 
to the liver via the bloodstream, and reproduce sexually. Fertilized eggs are then released in the 
feces of infected people and the cycle begins again. 


Schistosomiasis is endemic to Egypt, so this was not a new disease for this country. What 
was surprising was the increase in incidence. Health officials discovered that it was the irrigation 
canals that divert freshwater from the Aswan High Dam reservoir to agricultural lands that were 
responsible. The irrigation canals were not only the perfect breeding grounds for the freshwater 
snails, but were also a favourite place for local children to swim on hot days. In the past decade, a 
better understanding of the schistosomiasis life cycle and an aggressive public health education 
program has significantly reduced the rate of infection. 


One of the best-documented cases of environmental change leading to disease 
emergence is Lyme disease. Lyme disease is caused by the bacterium Borrelia burgdorferi 
and is spread by deer ticks (Ixodidea spp.; Hill, 2006). These ticks feed on mammalian 
blood and the bacterium is spread when an infected tick bites its host. The deer tick life 
cycle typically alternates between rodent hosts (larvae and nymphs) and deer hosts 
(adults; Figure 37). However, nymphs can feed on humans as well as rodents. Starting 
in the 1970’s, large numbers of people left the urban centres of the eastern seaboard 
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states to live on acreage properties and in small towns surrounded by forest. In doing 
so, they encroached into the habitat of deer, mice, and ticks and inadvertently became 
an alternate tick nymph host. The disease is named after the town of Lyme, 
Connecticut, the place where Lyme disease first emerged (Hill, 2006). 
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Figure 37. Deer tick nymphs prefer to feed on rodents and small mammals, but will feed on 
humans if they have the opportunity. Deer ticks may have a life cycle that involves one, 
two (as depicted) or three hosts. Deer ticks can carry the bacterium that causes Lyme 
disease. 


If left untreated, Lyme disease can become a chronic, debilitating disease with 
arthritis-like symptoms. Luckily, this condition can be avoided if someone receives a 
course of antibiotics soon after being infected with the bacterium. Because deer ticks 
are so small, it is likely that you won’t notice them on your body. So how would you 
know if you’ve been infected with Lyme disease? You will have a mild fever and a 
characteristic rash in the shape of a “bull’s eye”. While the rash is still present, there is 
time to receive treatment and be cured of the disease. 
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Conversion of natural habitat to agricultural land is also associated with 
emerging infectious diseases. Agriculture tends to increase rodent populations because 
the crops provide additional food for the rats and mice. This is what happened in the 
emergence of Venezuelan Hemorrhagic Fever (VHF). This disease is caused by a virus 
and is spread to humans who come into contact with infected rodent feces, such as in a 
barn or grain storage facility. VHF emerged in Venezuela’s Guanarito region during the 
1980's following massive deforestation and the establishment of agriculture in the 
region. 


Rice fields, in particular, are havens for 
merging infectious diseases. Rice is the most 
important food grain in the world and the most 
common method for growing it is to flood the fields in 
which it is planted. This is done for several reasons: 
rice requires a large quantity of water in order to 
grow, and flooding reduces the numbers of weeds and 
rodents in those fields. However, even though this technique reduces rodent numbers, 
the flooded fields become breeding grounds for mosquitoes that spread diseases such as 
malaria and Japanese encephalitis. 


Finally, changes in global climate patterns can also contribute to emerging 
diseases. In particular, changes in precipitation patterns and temperature can alter the 
geographic distribution of certain disease vectors. For example, climate change has 
been implicated in both the range expansion of Lyme disease and the emergence of 
West Nile virus in Canada (Ogden et al., 2009; Wang et al., 2011). 
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Conclusion 


The world that we live in, both physical and biological, has been shaped by 
natural disasters and infectious diseases. Although at first glance, these two topics 
might seem unrelated, recent examples show their interconnectivity and influence on 
human society. For example, the 2010 earthquake and tsunami that hit Haiti was 
followed by a cholera outbreak. In a country that had long eradicated this infectious 
disease, it was the initial earthquake that damaged infrastructure (and access to health 
care), the tsunami that spread unsanitary water, and the introduction of foreign aid 
workers carrying diseases, that caused such devastation to this country. 


As much as this was a horrific event, and continues to be so, it is our scientific 
knowledge and numerical capabilities that enable society to overcome such disasters. 
We now have a better ability to predict future events, to be able to effectively respond to 
and learn from such disasters and disease outbreak. Our history and current society has 
been shaped by our past successes and failures but at the heart of these issues is our 
ever changing and growing knowledge of the world around us. 
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